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Title Of The Invention 

NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 

15 

Related Applications 

This application is a Continuation-in-Part which claims priority to U.S. Application 

No, 09/134,001, filed August 13, 1998, which claims the benefit of both U.S. Provisional 

Application No. 60/064,964, filed November 8, 1997 and U.S. Provisional Application No. 

20 60/055,779, filed August 14, 1997, the contents of all of which arejncorporated in their 

entirety. 

Background Of The Invention 

Staphylococcus epidermidis (S. epidermidis) is a species of staphylococcal bacteria 

25 that are Gram-positive, nonmotile, nonpigmented and coagulase-negative cocci, which are 
mainly found on the skin and mucous membrane of warm-blooded animals. Their large 
numbers and ubiquitous distribution result in frequent contamination of specimens collected 
from or through the skin, making these organisms amongst the most frequently isolated in the 
clinical laboratory. In the past, S. epidermidis was rarely the cause of significant infections, 

30 but with the increasing use of implanted catheters and prosthetic devices, it has emerged as an 
important agent of hospital-acquired infections and has been recognized as a true pathogen 
(Lowy and Hammer, 1983, Ann Intern Med, 99: 834-9; Blum and Rodvold, 1987, Clin 
Pharm, 6: 464-75; Hamory, Parisi et al., 1987, Am J Infect Control, 15: 59-74). S. 
epidermidis is a major cause of infection of indwelling foreign devices such as, orthopedic 

35 devices, intravenous catheters, prosthetic heart valves, central nervous system shunts, and 

peritoneal dialysis catheters (Blum and Rodvold, 1987, Clin Pharm, 6: 464-75; Archer, 1988, 
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J Antimicrob Chemother, 21 Suppl C : 133-8)(Lowy and Hammer, 1983, Ann Intern Med, 99: 
834-9; Hamory, Parisi et al, Staphylococcus 1987, Am J Infect Control, 15: 59-74). In 
addition S. epidermidis is a common cause of postoperative wound infections, bacteremia of 
immunosuppressed patients, intensive-care unit patients and premature newborns (MacLowry, 
5 1983, Am J Med, 75: 2-6)(Eykyn, 1988, Lancet, 1: 100-4). According to a national survey 
(Centers for Disease Control, 1981:7) S. epidermidis caused 8.9% of primary nosocomial 
bacteremias. 

Treatment of S. epidermidis infections remains difficult because of the occult nature, 
association with foreign bodies, and frequent resistance to antimicrobial agents. Ordinarily, S. 

10 epidermidis is an organism with low virulence, however breaks in host defense caused by 
surgery, catheter placement, prosthesis insertion or immuno-suppression is prerequisite for 
infection. The presence of foreign bodies itself facilitates infection by protecting the organism 
from elimination by host defenses or antimicrobial therapy (Lowy and Hammer, 1983, Ann 
Intern Med, 99: 834-9). Furthermore, S. epidermidis due to its ability to produce extracellular 

1 5 polysaccharide material or slime, may be uniquely adapted to adhere to smooth surfaces such 
as plastics or metal. Slime producing strains of S. epidermidis appear to be more pathogenic 
than non-slime producing strains (Christensen, Simpson et al., 1983, Infect Immun, 40: 407- 
10; Peters and Pulverer, 1984, J Antimicrob Chemother, 14 Suppl D : 67-71; Gallimore, 
Gagnon et al., 1991, J Infect Dis, 164 : 1220-3). This property and many factors are involved 

20 in the pathogenesis of device associated infections. Despite the increased recognition as a 
pathogen, S. epidermidis infections are difficult to diagnose. Differentiating clinically 
important from clinically unimportant bacterial isolates of S. epidermidis is difficult because 
of the high rate of contamination. 

Although laboratory isolates of S. epidermidis have generally been susceptible to 

25 semisynthetic penicillins (methicillin, nafcillin, oxacillin), cephalosporins, amino-glycosides, 
vancomycin and rafampin, recent clinical isolates have had an increased resistance. Recent 
reports (Karchmer, 1985, Am J Med, 28: 1 16-27; Karchmer, 1991, J Hosp Infect, 18 Suppl 
A: 355-66) show that 83% of S. epidermidis isolates from patients with prosthetic valve 
endocarditis are methicillin resistant and 32% are gentamicin resistant as well. Multi-drug 

30 resistant staphylococci have emerged in the midst of high level use of penicillin and 
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aminoglycosides (Centers for Disease Control and Prevention, 1993 MMWR 42:597; and S. 
Handwerger et al., 1993, Clin Infect Dis 16:750). 

The use of antibiotics for therapeutics and prophylactic purposes, promotes the 
selection of resistant organisms and the spread of antibiotic resistance genes among bacteria. 
5 Previous studies have shown that virtually all staphylococci carry some antibiotic resistance 
genes on naturally occurring extrachromosomal mobile genetic elements, such as the 
plasmids. Survey and analysis of plasmids in clinical isolates of S. epidermidis have shown 
that more that 80% of isolates carry plasmids and in several cases more than one plasmid 
(Archer et al., 1982, Infect Immun, 35:627-632; Kloos et al., 1981, Can J Microbiol, 27:271- 

10 278; Moller, 1988, J Hosp Infect 12:19-27). Though the most important forms of resistance 
has been the inactivation of antibiotics, particularly penicillins and cephalosporins, recent 
clinical isolates have resistance to one or more of the following antibiotics, methicillin, 
tetracycline, erythromycin, gentamycin, kanamycin and chloramphenicol. In fact due to the 
wide spread occurrence of plasmids and their involvement in antibiotic resistance, plasmid 

1 5 profiling has been used as an epidemiological reagent to study nosocomial infections. This 
invention relates to isolated nucleic acids and polypeptides derived from S. epidermidis 
plasmids that are useful as molecular targets for diagnosis, prophylaxis and treatment of 
pathological conditions, as well as materials and methods for the diagnosis, prevention, and 
amelioration of pathological conditions resulting from bacterial infection. 

20 These concerns point to the need for diagnostic tools and therapeutics aimed at proper 

identification of strain and eradication of virulence. The design of vaccines that will limit the 
spread of infection and halt transfer of resistance factors is very desirable. 

Summary Of The Invention 
25 The present invention fulfills the need for diagnostic tools and therapeutics by 

providing bacterial-specific compositions and methods for detecting Staphylococcus species 
including S. epidermidis , as well as compositions and methods useful for treating and 
preventing Staphylococcus infection, in particular, S. epidermidis infection, in vertebrates 
including mammals. 
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The present invention encompasses isolated nucleic acids and polypeptides derived 
from S. epidermidis that are useful as reagents for diagnosis of bacterial disease, components 
of effective antibacterial vaccines, and/or as targets for antibacterial drugs including anti-5. 
epidermidis drugs. They can also be used to detect the presence of S. epidermidis and other 
5 Staphylococcus species in a sample; and in screening compounds for the ability to interfere 
with the S. epidermidis life cycle or to inhibit S. epidermidis infection. They also have use as 
biocontrol agents for plants. 

In one aspect, the invention features compositions of nucleic acids corresponding to 
entire coding sequences of S. epidermidis proteins (SEQ ID NO: 1 - SEQ ID NO: 3702) , 
10 including surface or secreted proteins or parts thereof, nucleic acids capable of binding mRNA 
from S. epidermidis proteins to block protein translation, and methods for producing S. 
epidermidis proteins or parts thereof using peptide synthesis and recombinant DNA 
techniques. This invention also features antibodies and nucleic acids useful as probes to 
detect S. epidermidis infection. In addition, vaccine compositions and methods for the 
1 5 protection or treatment of infection by S. epidermidis are within the scope of this invention. 

In another aspect, the invention relates to the nucliec acids corresponding to 2 
naturally occurring plasmids of S. epidermidis (SEQ ID NO: 3703 - SEQ ID NO: 3772) and 
the corresponding amino acid sequences (SEQ ID NO: 7475 - SEQ ID NO: 7544). 

The nucleotide sequences provided in SEQ ID NO: 1 - SEQ ID NO: 3772, a fragment 
20 thereof, or a nucleotide sequence at least about 99.5% identical to a sequence contained within 
SEQ ID NO: 1 - SEQ ID NO: 3772 may be "provided" in a variety of medias to facilitate use 
thereof. As used herein, "provided" refers to a manufacture, other than an isolated nucleic 
acid molecule, which contains a nucleotide sequence of the present invention, i.e., the 
nucleotide sequence provided in SEQ ID NO: 1 - SEQ ID NO: 3772, a fragment thereof, or a 
25 nucleotide sequence at least about 99.5% identical to a sequence contained within SEQ ID 
NO: 1 - SEQ ID NO: 3772. Uses for and methods for providing nucleotide sequences in a 
variety of media is well known in the art (see e.g., EPO Publication No. EP 0 756 006). 
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In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any media which can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage 
5 media, and magnetic tape; optical storage media such as CD-ROM; electrical storage media 
such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage 
media. A person skilled in the art can readily appreciate how any of the presently known 
computer readable media can be used to create a manufacture comprising computer readable 
media having recorded thereon a nucleotide sequence of the present invention. 

10 As used herein, "recorded" refers to a process for storing information on computer 

readable media. A person skilled in the art can readily adopt any of the presently known 
methods for recording information on computer readable media to generate manufactures 
comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a person skilled in the art for 

1 5 creating a computer readable media having recorded thereon a nucleotide sequence of the 
present invention. The choice of the data storage structure will generally be based on the 
means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable media. The sequence information can be represented in a 

20 word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. A person skilled in the art can readily 
adapt any number of data processor structuring formats (e.g. text file or database) in order to 
obtain computer readable media having recorded thereon the nucleotide sequence information 

25 of the present invention. 

By providing the nucleotide sequence of SEQ ID NO: 1 - SEQ ID NO: 3772, a 
fragment thereof, or a nucleotide sequence at least about 99.5% identical to SEQ ID NO: 1 - 
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SEQ ID NO: 3772 in computer readable form, a person skilled in the art can routinely access 
the coding sequence information for a variety of purposes. Computer software is publicly 
available which allows a person skilled in the art to access sequence information provided in a 
computer readable media. Examples of such computer software include programs of the 
5 "Staden Package 5 ', "DNA Star", "MacVector", GCG "Wisconsin Package" (Genetics 

Computer Group, Madison, WI) and "NCBI Toolbox" (National Center For Biotechnology 
Information). Suitable programs are described, for example, in Martin J. Bishop, ed., Guide 
to Human Genome Computing, 2d Edition, Academic Press, San Diego, CA. (1998); and 
Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: 

10 Tools for Genomic and Molecular Research, American Society for Microbiology, 
Washington, D.C. (1 997). 

Computer algorithms enable the identification of S. epidermidis open reading frames 
(ORFs) within SEQ ID NO: 1 - SEQ ID NO: 3772 which contain homology to ORFs or 
proteins from other organisms. Examples of such similarity-search algorithms include the 

15 BLAST [Altschul et al., J. Mol. Biol. 215:403-410 (1990)] and Smith- Waterman [Smith and 
Waterman (1981) Advances in Applied Mathematics, 2:482-489] search algorithms. Suitable 
search algorithms are described, for example, in Martin J. Bishop, ed., Guide to Human 
Genome Computing, 2d Edition, Academic Press, San Diego, CA. (1998); and Leonard F. 
Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: Tools for 

20 Genomic and Molecular Research, American Society for Microbiology, Washington, D.C. 
(1997). Such algorithms are utilized on computer systems as exemplified below. The ORFs 
so identified represent protein encoding fragments within the S. epidermidis genome and S. 
epidermidis plasmidsmd are useful in producing commercially important proteins such as 
enzymes used in fermentation reactions and in the production of commercially useful 

25 metabolites. 

The present invention further provides systems, particularly computer-based systems, 
which contain the sequence information described herein. Such systems are designed to 
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identify commercially important fragments of the S. epidermidis genome and plasmids. As 
used herein, "a computer-based system" refers to the hardware means, software means, and 
data storage means used to analyze the nucleotide sequence information of the present 
invention. The minimum hardware means of the computer-based systems of the present 
5 invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A person skilled in the art can readily appreciate that any one of the currently 
available computer-based systems is suitable for use in the present invention. The computer- 
based systems of the present invention comprise a data storage means having stored therein a 
nucleotide sequence of the present invention and the necessary hardware means and software 

10 means for supporting and implementing a search means. As used herein, "data storage means" 
refers to memory which can store nucleotide sequence information of the present invention, or 
a memory access means which can access manufactures having recorded thereon the 
nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are 

1 5 implemented on the computer-based system to compare a target sequence or target structural 
motif with the sequence information stored within the data storage means. Search means are 
used to identify fragments or regions of the S. epidermidis genome and plasmids which are 
similar to, or "match", a particular target sequence or target motif. A variety of known 
algorithms are known in the art and have been disclosed publicly, and a variety of 

20 commercially available software for conducting homology-based similarity searches are 
available and can be used in the computer-based systems of the present invention. Examples 
of such software includes, but is not limited to, FASTA (GCG Wisconsin Package), Bic_SW 
(Compugen Bioccelerator), BLASTN2, BLASTP2, BLASTX2 (NCBI) and Motifs (GCG). 
Suitable software programs are described, for example, in Martin J. Bishop, ed., Guide to 

25 Human Genome Computing, 2d Edition, Academic Press, San Diego, CA. (1998); and 
Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: 
Tools for Genomic and Molecular Research, American Society for Microbiology, 
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Washington, D.C. (1997). A person skilled in the art can readily recognize that any one of 
the available algorithms or implementing software packages for conducting homology 
searches can be adapted for use in the present computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of six or 
5 more nucleotides or two or more amino acids. A person skilled in the art can readily recognize 
that the longer a target sequence is, the less likely a target sequence will be present as a 
random occurrence in the database. The most preferred sequence length of a target sequence is 
from about 10 to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is 
well recognized that many genes are longer than 500 amino acids, or 1 .5 kb in length, and that 

10 commercially important fragments of the S. epidermidis genome and plasmids from S. 

epidermidis, such as sequence fragments involved in gene expression and protein processing, 
will often be shorter than 30 nucleotides. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based on 

1 5 a specific functional domain or three-dimensional configuration which is formed upon the 
folding of the target polypeptide. There are a variety of target motifs known in the art. Protein 
target motifs include, but are not limited to, enzymatic active sites, membrane-spanning 
regions, and signal sequences. Nucleic acid target motifs include, but are not limited to, 
promoter sequences, hairpin structures and inducible expression elements (protein binding 

20 sequences). 

A variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. A preferred 
format for an output means ranks fragments of the S. epidermidis genome and plasmids 
possessing varying degrees of homology to the target sequence or target motif. Such 
25 presentation provides a person skilled in the art with a ranking of sequences which contain 
various amounts of the target sequence or target motif and identifies the degree of homology 
contained in the identified fragment. 
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A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the S. epidermidis 
genome and plasmids. In the present examples, implementing software which implement the 
BLASTP2 and bic_SW algorithms (Altschul et al., J Mol. Biol. 215:403-410 (1990); 
5 Compugen Biocellerator) was used to identify open reading frames within the S. epidermidis 
genome and plasmids. A person skilled in the art can readily recognize that any one of the 
publicly available homology search programs can be used as the search means for the 
computer-based systems of the present invention. Suitable programs are described, for 
example, in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, 

10 Academic Press, San Diego, CA. (1998); and Leonard F. Peruski, Jr., and Anne Harwood 
Peruski, The Internet and the New Biology : Tools for Genomic and Molecular Research, 
American Society for Microbiology, Washington, D.C. (1997). 

The invention features S. epidermidis polypeptides, preferably a substantially pure 
preparation of an S. epidermidis polypeptide, or a recombinant S. epidermidis polypeptide. 

15 In preferred embodiments: the polypeptide has biological activity; the polypeptide has an 
amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% identical to an 
amino acid sequence of the invention contained in the Sequence Listing, preferably it has 
about 65% sequence identity with an amino acid sequence of the invention contained in the 
Sequence Listing, and most preferably it has about 92% to about 99% sequence identity with 

20 an amino acid sequence of the invention contained in the Sequence Listing; the polypeptide 
has an amino acid sequence essentially the same as an amino acid sequence of the invention 
contained in the Sequence Listing; the polypeptide is at least about 5, 10, 20, 50, 100, or 150 
amino acid residues in length; the polypeptide includes at least about 5, preferably at least 
about 10, more preferably at least about 20, still more preferably at least about 50, 100, or 150 

25 contiguous amino acid residues of the invention contained in the Sequence Listing. In yet 
another preferred embodiment, the amino acid sequence which differs in sequence identity by 
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about 7% to about 8% from the S. epidermidis amino acid sequences of the invention 
contained in the Sequence Listing is also encompassed by the invention. 

In preferred embodiments: the S. epidermidis polypeptide is encoded by a nucleic 
acid of the invention contained in the Sequence Listing, or by a nucleic acid having at least 
5 about 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of the 
invention contained in the Sequence Listing. 

In a preferred embodiment, the subject S. epidermidis polypeptide differs in amino 
acid sequence at about 1, 2, 3, 5, 10 or more residues from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that the S. epidermidis 
10 polypeptide exhibits an S. epidermidis biological activity, e.g., the S. epidermidis polypeptide 
retains a biological activity of a naturally occurring S. epidermidis enzyme. 

In preferred embodiments, the polypeptide includes all or a fragment of an amino acid 
sequence of the invention contained in the Sequence Listing; fused, in reading frame, to 
additional amino acid residues, preferably to residues encoded by genomic DNA 5 f or 3' to the 
1 5 genomic DNA which encodes a sequence of the invention contained in the Sequence Listing. 

In yet other preferred embodiments, the S. epidermidis polypeptide is a recombinant 
fusion protein having a first S. epidermidis polypeptide portion and a second polypeptide 
portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to S. 
epidermidis . The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a 
20 DNA binding domain, or a polymerase activating domain. In preferred embodiment the 
fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postranslational events. 

25 In a preferred embodiment, the encoded S. epidermidis polypeptide differs (e.g., by 

amino acid substitution, addition or deletion of at least one amino acid residue) in amino acid 
sequence at about 1, 2, 3, 5, 10 or more residues, from a sequence of the invention contained 

10 
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in the Sequence Listing. The differences, however, are such that: the S. epidermidis encoded 
polypeptide exhibits an S. epidermidis biological activity, e.g., the encoded S. epidermidis 
enzyme retains a biological activity of a naturally occurring S. epidermidis . 

In preferred embodiments, the encoded polypeptide includes all or a fragment of an 
5 amino acid sequence of the invention contained in the Sequence Listing; fused, in reading 
frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' 
or 3' to the genomic DNA which encodes a sequence of the invention contained in the 
Sequence Listing. 

The S. epidermidis strain, 18972, from which genomic sequences have been 
10 sequenced, has been deposited on July 10, 1997, in the American Type Culture Collection and 
assigned the ATCC designation # 55998. 

Included in the invention are: allelic variations; natural mutants; induced mutants; 
proteins encoded by DNA that hybridize under high or low stringency conditions to a nucleic 
acid which encodes a polypeptide of the invention contained in the Sequence Listing (for 
1 5 definitions of high and low stringency see Current Protocols in Molecular Biology, John 
Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, 
polypeptides specifically bound by antisera to S. epidermidis polypeptides, especially by 
antisera to an active site or binding domain of S. epidermidis polypeptide. The invention also 
includes fragments, preferably biologically active fragments. These and other polypeptides 
20 are also referred to herein as S. epidermidis polypeptide analogs or variants. 

The invention further provides nucleic acids, e.g., RNA or DNA, encoding a 
polypeptide of the invention. This includes double stranded nucleic acids as well as coding 
and antisense single strands. 

In preferred embodiments, the subject S. epidermidis nucleic acid will include a 
25 transcriptional regulatory sequence, e.g., at least one of a transcriptional promoter or 

transcriptional enhancer sequence, operably linked to the S. epidermidis gene sequence, e.g., 
to render the S. epidermidis gene sequence suitable for expression in a recombinant host cell. 
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In yet a further preferred embodiment, the nucleic acid which encodes an 5. 
epidermidis polypeptide of the invention, hybridizes under stringent conditions to a nucleic 
acid probe corresponding to at least about 8 consecutive nucleotides of the invention 
contained in the Sequence Listing; more preferably to at least about 12 consecutive 
5 nucleotides of the invention contained in the Sequence Listing; still more preferably to at least 
about 20 consecutive nucleotides of the invention contained in the Sequence Listing; most 
preferably to at least about 40 consecutive nucleotides of the invention contained in the 
Sequence Listing. 

In another aspect, the invention provides a substantially pure nucleic acid having a 
10 nucleotide sequence which encodes an S. epidermidis polypeptide. In preferred 

embodiments: the encoded polypeptide has biological activity; the encoded polypeptide has 
an amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98% or 99% homologous 
to an amino acid sequence of the invention contained in the Sequence Listing; the encoded 
polypeptide has an amino acid sequence essentially the same as an amino acid sequence of the 
15 invention contained in the Sequence Listing; the encoded polypeptide is at least about 5, 10, 
20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least about 5, 
preferably at least about 10, more preferably at least about 20, still more preferably at least 
about 50, 100, or 150 contiguous amino acids of the invention contained in the Sequence 
Listing. 

20 In another aspect, the invention encompasses: a vector including a nucleic acid which 

encodes an S. epidermidis polypeptide or an S. epidermidis polypeptide variant as described 
herein; a host cell transfected with the vector; and a method of producing a recombinant S. 
epidermidis polypeptide or S. epidermidis polypeptide variant; including culturing the cell, 
e.g., in a cell culture medium, and isolating an S. epidermidis or S. epidermidis polypeptide 

25 variant, e.g., from the cell or from the cell culture medium. 

One embodiment of the invention is directed to substantially isolated nucleic acids. 
Nucleic acids of the invention include sequences comprising at least about 8 nucleotides in 

12 
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length, more preferably at least about 12 nucleotides in length, even more preferably at least 
about 15-20 nucleotides in length, that correspond to a subsequence of any one of SEQ ID 
NO: 1 - SEQ ID NO: 3772 or complements thereof. Alternatively, the nucleic acids comprise 
sequences contained within any ORF (open reading frame), including a complete protein- 
5 coding sequence, of which any of SEQ ID NO: 1 - SEQ ID NO: 3772 forms a part. The 
invention encompasses sequence-conservative variants and function-conservative variants of 
these sequences. The nucleic acids may be DNA, RNA, DNA/RNA duplexes, protein- 
nucleic acid (PNA), or derivatives thereof. 

In another aspect, the invention features a purified recombinant nucleic acid having at 

10 least about 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a sequence of the 
invention contained in the Sequence Listing 

The invention also encompasses recombinant DNA (including DNA cloning and 
expression vectors) comprising these S. epidermidis -derived sequences; host cells 
comprising such DNA, including fungal, bacterial, yeast, plant, insect, and mammalian host 

1 5 cells; and methods for producing expression products comprising RNA and polypeptides 

encoded by the S. epidermidis sequences. These methods are carried out by incubating a host 
cell comprising an S. epidermidis -derived nucleic acid sequence under conditions in which 
the sequence is expressed. The host cell may be native or recombinant. The polypeptides can 
be obtained by (a) harvesting the incubated cells to produce a cell fraction and a medium 

20 fraction; and (b) recovering the £ epidermidis polypeptide from the cell fraction, the medium 
fraction, or both. The polypeptides can also be made by in vitro translation. 

In another aspect, the invention features nucleic acids capable of binding mRNA of S. 
epidermidis . Such nucleic acid is capable of acting as antisense nucleic acid to control the 
translation of mRNA of S. epidermidis . A further aspect features a nucleic acid which is 

25 capable of binding specifically to an S. epidermidis nucleic acid. These nucleic acids are also 
referred to herein as complements and have utility as probes and as capture reagents. 



13 



Attorney Docket: PATH03-16 



In another aspect, the invention features an expression system comprising an open 
reading frame corresponding to S. epidermidis nucleic acid. The nucleic acid further 
comprises a control sequence compatible with an intended host. The expression system is 
useful for making polypeptides corresponding to S. epidermidis nucleic acid. 
5 In another aspect, the invention encompasses: a vector including a nucleic acid which 

encodes an & epidermidis polypeptide or an S. epidermidis polypeptide variant as described 
herein; a host cell transfected with the vector; and a method of producing a recombinant S. 
epidermidis polypeptide or S. epidermidis polypeptide variant; including culturing the cell, 
e.g., in a cell culture medium, and isolating the S. epidermidis or & epidermidis polypeptide 

10 variant, e.g., from the cell or from the cell culture medium. 

In yet another embodiment of the invention encompasses reagents for detecting 
bacterial infection, including S. epidermidis infection, which comprise at least one S. 
epidermidis -derived nucleic acid defined by any one of SEQ ID NO: 1 - SEQ ID NO: 3772, 
or sequence-conservative or function-conservative variants thereof. Alternatively, the 

15 diagnostic reagents comprise nucleotide sequences that are contained within any open reading 
frames (ORFs), including preferably complete protein-coding sequences, contained within any 
of SEQ ID NO: 1 - SEQ ID NO: 3772, or polypeptide sequences contained within any of SEQ 
ID NO: 3773 - SEQ ID NO: 7544, or polypeptides of which any of the above sequences forms 
a part, or antibodies directed against any of the above peptide sequences or function- 

20 conservative variants and/or fragments thereof. 

The invention further provides antibodies, preferably monoclonal antibodies, which 
specifically bind to the polypeptides of the invention. Methods are also provided for 
producing antibodies in a host animal. The methods of the invention comprise immunizing an 
animal with at least one S. epidermidis -derived immunogenic component, wherein the 

25 immunogenic component comprises one or more of the polypeptides encoded by any one of 
SEQ ID NO: 1 - SEQ ID NO: 3772 or sequence-conservative or function-conservative 
variants thereof; or polypeptides that are contained within any ORFs, including complete 
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protein-coding sequences, of which any of SEQ ID NO: 1 - SEQ ID NO: 3772 forms a part; or 
polypeptide sequences contained within any of SEQ ID NO: 3773 - SEQ ID NO: 7544; or 
polypeptides of which any of SEQ ID NO: 3773 - SEQ ID NO: 7544 forms a part. Host 
animals include any warm blooded animal, including without limitation mammals and birds. 
5 Such antibodies have utility as reagents for immunoassays to evaluate the abundance and 
distribution of S. epidermidis -specific antigens. 

In yet another aspect, the invention provides diagnostic methods for detecting S. 
epidermidis antigenic components or anti-5. epidermidis antibodies in a sample. S. 
epidermidis antigenic components may be detected by known processes, including but not 

10 limited to detection by a process comprising: (i) contacting a sample suspected to contain a 
bacterial antigenic component with a bacterial-specific antibody, under conditions in which a 
stable antigen-antibody complex can form between the antibody and bacterial antigenic 
components in the sample; and (ii) detecting any antigen-antibody complex formed in step (i), 
wherein detection of an antigen-antibody complex indicates the presence of at least one 

15 bacterial antigenic component in the sample. In different embodiments of this method, the 
antibodies used are directed against a sequence encoded by any of SEQ ID NO: 1 - SEQ ID 
NO: 3772 or sequence-conservative or function-conservative variants thereof, or against a 
polypeptide sequence contained in any of SEQ ID NO: 3773 - SEQ ID NO: 7544 or function- 
conservative variants thereof. 

20 In yet another aspect, the invention provides a method for detecting antibacterial- 

specific antibodies in a sample, which comprises: (i) contacting a sample suspected to contain 
antibacterial-specific antibodies with an & epidermidis antigenic component, under 
conditions in which a stable antigen-antibody complex can form between the S. epidermidis 
antigenic component and antibacterial antibodies in the sample; and (ii) detecting any antigen- 

25 antibody complex formed in step (i), wherein detection of an antigen-antibody complex 

indicates the presence of antibacterial antibodies in the sample. In different embodiments of 
this method, the antigenic component is encoded by a sequence contained in any of SEQ ID 
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NO: 1 - SEQ ID NO: 3772 or sequence-conservative and function-conservative variants 
thereof, or is a polypeptide sequence contained in any of SEQ ID NO: 3773 - SEQ ID NO: 
7544 or function-conservative variants thereof 

In another aspect, the invention features a method of generating vaccines for 
5 immunizing an individual against S. epidermidis . The method includes: immunizing a 
subject with an S. epidermidis polypeptide, e.g., a surface or secreted polypeptide, or a 
combination of such peptides or active portion(s) thereof, and a pharmaceutically acceptable 
carrier. Such vaccines have therapeutic and prophylactic utilities. 

In another aspect, the invention features a method of evaluating a compound, e.g., a 

10 polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an S. 

epidermidis polypeptide. The method includes contacting the compound to be evaluated with 
an S. epidermidis polypeptide and determining if the compound binds or otherwise interacts 
with the S. epidermidis polypeptide. Compounds which bind or otherwise interact with S. 
epidermidis polypeptides are candidates as modulators, including activators and inhibitors, of 

15 the bacterial life cycle. These assays can be performed in vitro or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g., a 
polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an S, 
epidermidis nucleic acid, e.g., DNA or RNA. The method includes contacting the compound 
to be evaluated with an S. epidermidis nucleic acid and determining if the compound binds or 

20 otherwise interacts with the S. epidermidis nucleic acid. Compounds which bind S. 

epidermidis are candidates as modultors, including activators and inhibitors, of the bacterial 
life cycle. These assays can be performed in vitro or in vivo. 

A particularly preferred embodiment of the invention is directed to a method of 
screening test compounds for anti-bacterial activity, which method comprises: selecting as a 

25 target a bacterial specific sequence, which sequence is essential to the viability of a bacterial 
species; contacting a test compound with said target sequence; and selecting those test 
compounds which bind to said target sequence as potential anti-bacterial candidates. In one 
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embodiment, the target sequence selected is specific to a single species, or even a single 
strain, such as, for example, the strain S. epidermidis 18972. In a second embodiment, the 
target sequence is common to at least two species of bacteria. In a third embodiment, the 
target sequence is common to a family of bacteria. The target sequence may be a nucleic acid 
5 sequence or a polypeptide sequence. Methods employing sequences common to more than 
one species of microorganism may be used to screen candidates for broad spectrum anti- 
bacterial activity. 

The invention also provides methods for preventing or treating disease caused by 
certain bacteria, including S. epidermidis , which are carried out by administering to an 
10 animal in need of such treatment, in particular a warm-blooded vertebrate, including but not 
limited to birds and mammals, a compound that specifically inhibits or interferes with the 
function of a bacterial polypeptide or nucleic acid. In a particularly preferred embodiment, the 
mammal to be treated is human. 

15 DETAILED DESCRIPTION OF THE INVENTION 

The sequences of the present invention include the specific nucleic acid and amino 
acid sequences set forth in the Sequence Listing that forms a part of the present specification, 
and which are designated SEQ ID NO: 1 - SEQ ID NO: 7544. Use of the terms "SEQ ID NO: 
1 - SEQ ID NO: 3772 ", " SEQ ID NO: 3773 - SEQ ID NO: 7544, "the sequences depicted in 

20 Table 2", etc., is intended, for convenience, to refer to each individual SEQ ID NO 

individually, and is not intended to refer to the genus of these sequences unless such reference 
would be indicated. In other words, it is a shorthand for listing all of these sequences 
individually. The invention encompasses each sequence individually, as well as any 
combination thereof. 

25 
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Definitions 

"Nucleic acid" or "polynucleotide" as used herein refers to purine- and pyrimidine- 
containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides or 
mixed polyribo-polydeoxyribo nucleotides. This includes single- and double-stranded 
5 molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as "protein nucleic 
acids" (PNA) formed by conjugating bases to an amino acid backbone. This also includes 
nucleic acids containing modified bases. 

A nucleic acid or polypeptide sequence that is "derived from" a designated sequence 
refers to a sequence that corresponds to a region of the designated sequence. For nucleic acid 

10 sequences, this encompasses sequences that are homologous or complementary to the 

sequence, as well as "sequence-conservative variants" and "function-conservative variants." 
For polypeptide sequences, this encompasses "function-conservative variants." Sequence- 
conservative variants are those in which a change of one or more nucleotides in a given codon 
position results in no alteration in the amino acid encoded at that position. Function- 

1 5 conservative variants are those in which a given amino acid residue in a polypeptide has been 
changed without altering the overall conformation and function of the native polypeptide, 
including, but not limited to, replacement of an amino acid with one having similar physico- 
chemical properties (such as, for example, acidic, basic, hydrophobic, and the like). 
"Function-conservative" variants also include any polypeptides that have the ability to elicit 

20 antibodies specific to a designated polypeptide. 

An n S. epidermidis -derived" nucleic acid or polypeptide sequence may or may not be 
present in other bacterial species, and may or may not be present in all S. epidermidis strains. 
This term is intended to refer to the source from which the sequence was originally isolated. 
Thus, an S. epidermidis -derived polypeptide, as used herein, may be used, e.g., as a target to 

25 screen for a broad spectrum antibacterial agent, to search for homologous proteins in other 
species of bacteria or in eukaryotic organisms such as bacteria humans, etc. 
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A purified or isolated polypeptide or a substantially pure preparation of a polypeptide 
are used interchangeably herein and, as used herein, mean a polypeptide that has been 
separated from other proteins, lipids, and nucleic acids with which it naturally occurs. 
Preferably, the polypeptide is also separated from substances, e.g., antibodies or gel matrix, 
5 e.g., polyacrylamide, which are used to purify it. Preferably, the polypeptide constitutes at 
least about 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the 
preparation contains sufficient polypeptide to allow protein sequencing; at least about 1, 10, or 
preferably 100 mg of polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in vitro 

10 preparation of cells and not an entire intact plant or animal. In the case of cultured cells or 
microbial cells, it consists of a preparation of at least about 10%, more preferably at least 
about 50%, of the subject cells. 

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially pure 
DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both of the 

15 following: not immediately contiguous with both of the coding sequences with which it is 
immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the naturally-occurring 
genome and plasmids of the organism from which the nucleic acid is derived; or which is 
substantially free of a nucleic acid with which it occurs in the organism from which the 
nucleic acid is derived. The term includes, for example, a recombinant DNA which is 

20 incorporated into a vector, e.g., into an autonomously replicating plasmid or virus, or into the 
genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a 
cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) 
independent of other DNA sequences. Substantially pure DNA also includes a recombinant 
DNA which is part of a hybrid gene encoding additional S. epidermidis DNA sequence. 

25 A "contig" as used herein is a nucleic acid representing a continuous stretch of 

genomic sequence of an organism. 
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An "open reading frame", also referred to herein as ORP, is a region of nucleic acid 
which encodes a polypeptide. This region may represent a portion of a coding sequence or a 
total sequence and can be determined from a stop to stop codon or from a start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
5 messenger RNA and/or translated into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the coding sequence are determined by a 
translation start codon at the five prime terminus and a translation stop code at the three prime 
terminus. A coding sequence can include but is not limited to messenger RNA, synthetic 
DNA, and recombinant nucleic acid sequences. 
10 A "complement" of a nucleic acid as used herein refers to an anti-parallel or antisense 

sequence that participates in Watson-Crick base-pairing with the original sequence. 

A "gene product" is a protein or structural RNA which is specifically encoded by a 

gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or other chemical 
1 5 entity which specifically binds to a molecule of interest. Probes are often associated with or 
capable of associating with a label. A label is a chemical moiety capable of detection. Typical 
labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and the like. Similarly, 
a nucleic acid, peptide or other chemical entity which specifically binds to a molecule of 
20 interest and immobilizes such molecule is referred herein as a "capture ligand". Capture 
ligands are typically associated with or capable of associating with a support such as nitro- 
cellulose, glass, nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition of the nucleotides, 
and the temperature and salt concentration of the reaction. These conditions are readily 
25 discernable to one of ordinary skill in the art using routine experimentation. 

"Homologous" refers to the sequence similarity or sequence identity between two 
polypeptides or between two nucleic acid molecules. When a position in both of the two 
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compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if a 
position in each of two DNA molecules is occupied by adenine, then the molecules are 
homologous at that position. The percent of homology between two sequences is a function 
of the number of matching or homologous positions shared by the two sequences divided by 
5 the number of positions compared x 100. For example, if 6 of 10 of the positions in two 
sequences are matched or homologous then the two sequences are 60% homologous. By way 
of example, the DNA sequences ATTGCC and TATGGC share 50% homology. Generally, a 
comparison is made when two sequences are aligned to give maximum homology. 

Nucleic acids are hybridizable to each other when at least one strand of a nucleic acid 

1 0 can anneal to the other nucleic acid under defined stringency conditions. Stringency of 

hybridization is determined by: (a) the temperature at which hybridization and/or washing is 
performed; and (b) the ionic strength and polarity of the hybridization and washing solutions. 
Hybridization requires that the two nucleic acids contain complementary sequences; 
depending on the stringency of hybridization, however, mismatches may be tolerated. 

15 Typically, hybridization of two sequences at high stringency (such as, for example, in a 
solution of 0.5X SSC, at 65° C) requires that the sequences be essentially completely 
homologous. Conditions of intermediate stringency (such as, for example, 2X SSC at 65 0 C) 
and low stringency (such as, for example 2X SSC at 55° C) require correspondingly less 
overall complementarity between the hybridizing sequences. (IX SSC is 0.15 M NaCl, 0.015 

20 MNa citrate). 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 

As used herein, the term "surface protein" refers to all surface accessible proteins, e.g. 
inner and outer membrane proteins, proteins adhering to the cell wall, and secreted proteins. 

A polypeptide has S. epidermidis biological activity if it has one, two or preferably 
25 more of the following properties: (1) if when expressed in the course of an S. epidermidis 
infection, it can promote, or mediate the attachment of S. epidermidis to a cell; (2) it has an 
enzymatic activity, structural or regulatory function characteristic of an S. epidermidis 
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protein; (3) the gene which encodes it can rescue a lethal mutation in an S. epidermidis gene. 
A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a 
polypeptide having one of the above-listed properties. 

A biologically active fragment or analog is one having an in vivo or in vitro activity 
5 which is characteristic of the S. epidermidis polypeptides of the invention contained in the 
Sequence Listing, or of other naturally occurring S. epidermidis polypeptides, e.g., one or 
more of the biological activities described herein. Especially preferred are fragments which 
exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise 
from translation of alternatively spliced RNA's. Fragments include those expressed in native 

10 or endogenous cells as well as those made in expression systems, e.g., in CHO (Chinese 
Hamster Ovary) cells. Because peptides such as S. epidermidis polypeptides often exhibit a 
range of physiological properties and because such properties may be attributable to different 
portions of the molecule, a useful S. epidermidis fragment or S. epidermidis analog is one 
which exhibits a biological activity in any biological assay for S. epidermidis activity. The 

15 fragment or analog possesses about 10%, preferably about 40%, more preferably about 60%, 
70%, 80% or 90% or greater of the activity of S. epidermidis , in any in vivo or in vitro assay. 

Analogs can differ from naturally occurring S. epidermidis polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include changes in acetylation, methylation, phosphorylation, carboxylation, or glycosylation. 

20 Preferred analogs include S. epidermidis polypeptides (or biologically active fragments 
thereof) whose sequences differ from the wild-type sequence by one or more conservative 
amino acid substitutions or by one or more non-conservative amino acid substitutions, 
deletions, or insertions which do not substantially diminish the biological activity of the S. 
epidermidis polypeptide. Conservative substitutions typically include the substitution of one 

25 amino acid for another with similar characteristics, e.g., substitutions within the following 
groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic 
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acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 
Other conservative substitutions can be made in view of the table below. 
TABLE 1 



CONSERVATIVE AMINO AC 


ID REPLACEMENTS 


For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, 
D-Met, D-Ile, Orn, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


T"\ T\ A — A A TV A 1 T\ 1 

D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, p-Ala, Acp 


Isoleucine 


I 


D-Ile, Val, D-Val, Leu, D-Leu, Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


T vcmp 


v- 

-TV 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D- 

lV/f^t Tl<=> O Tip Orn Fi Orn 
1V1CL, 11C, iy-llC, L/lIl, U-Kjlll 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, or 5- 
phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), Val, D-Val 1 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val, Leu, D-Leu, He, D-Ile, Met, D-Met 
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Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs that 
5 include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
naturally occurring or synthetic amino acids, e.g., P or y amino acids; and cyclic analogs. 

As used herein, the term "fragment", as applied to an S. epidermidis analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, preferably at 
least about 60 residues in length. Fragments of S. epidermidis polypeptides can be generated 
10 by methods known to those skilled in the art. The ability of an Staphylococcus fragment to 
exhibit a biological activity of S. epidermidis polypeptide can be assessed by methods known 
to those skilled in the art as described herein. Also included are S. epidermidis polypeptides 
containing residues that are not required for biological activity of the peptide or that result 
from alternative mRNA splicing or alternative protein processing events. 
15 An "immunogenic component" as used herein is a moiety, such as an S. epidermidis 

polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or cellular 
immune response in a host animal. 

An "antigenic component" as used herein is a moiety, such as an S. epidermidis 
polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody with 
20 sufficiently high affinity to form a detectable antigen-antibody complex. 

The term "antibody" as used herein is intended to include fragments thereof which are 
specifically reactive with S. epidermidis polypeptides. 

As used herein, the term "cell-specific promoter" means a DNA sequence that serves 
as a promoter, i.e., regulates expression of a selected DNA sequence operably linked to the 
25 promoter, and which effects expression of the selected DNA sequence in specific cells of a 
tissue. The term also covers so-called "leaky" promoters, which regulate expression of a 
selected DNA primarily in one tissue, but cause expression in other tissues as well. 
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Misexpression, as used herein, refers to a non-wild type pattern of gene expression. It 
includes: expression at non-wild type levels, i.e., over or under expression; a pattern of 
expression that differs from wild type in terms of the time or stage at which the gene is 
expressed, e.g., increased or decreased expression (as compared with wild type) at a 
5 predetermined developmental period or stage; a pattern of expression that differs from wild 
type in terms of increased expression (as compared with wild type) in a predetermined cell 
type or tissue type; a pattern of expression that differs from wild type in terms of the splicing 
size, amino acid sequence, post-translational modification, or biological activity of the 
expressed polypeptide; a pattern of expression that differs from wild type in terms of the effect 

10 of an environmental stimulus or extracellular stimulus on expression of the gene, e.g., a 

pattern of increased or decreased expression (as compared with wild type) in the presence of 
an increase or decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or higher 
eukaryotic cell lines cultured as unicellular entities refers to cells which can become or have 

1 5 been used as recipients for a recombinant vector or other transfer DNA, and include the 

progeny of the original cell which has been transfected. It is understood by individuals skilled 
in the art that the progeny of a single parental cell may not necessarily be completely identical 
in genomic or total DNA compliment to the original parent, due to accident or deliberate 
mutation. 

20 As used herein, the term "control sequence" refers to a nucleic acid having a base 

sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs depending 
upon the host organism; in prokaryotes, such control sequences generally include a promoter, 
ribosomal binding site, terminators, and in some cases operators; in eukaryotes, generally such 

25 control sequences include promoters, terminators and in some instances, enhancers. The term 
control sequence is intended to include at a minimum, all components whose presence is 
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necessary for expression, and may also include additional components whose presence is 

advantageous, for example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated to 

function in their intended manner. For example, a control sequence is operably linked to 
5 coding sequence by ligation in such a way that expression of the coding sequence is achieved 

under conditions compatible with the control sequence and host cell. 

The "metabolism" of a substance, as used herein, means any aspect of the expression, 

function, action, or regulation of the substance. The metabolism of a substance includes 

modifications, e.g., covalent or non-covalent modifications of the substance. The metabolism 
10 of a substance includes modifications, e.g., covalent or non-covalent modification, the 

substance induces in other substances. The metabolism of a substance also includes changes 

in the distribution of the substance. The metabolism of a substance includes changes the 

substance induces in the distribution of other substances. 

A "sample" as used herein refers to a biological sample, such as, for example, tissue or 
1 5 fluid isloated from an individual (including without limitation plasma, serum, cerebrospinal 

fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture constituents, as well 

as samples from the environment. 

Technical and scientific terms used herein have the meanings commonly understood 

by one of ordinary skill in the art to which the present invention pertains, unless otherwise 
20 defined. Reference is made herein to various methodologies known to those of skill in the art. 

Publications and other materials setting forth such known methodologies to which reference is 

made are incorporated herein by reference in their entireties as though set forth in full. The 

practice of the invention will employ, unless otherwise indicated, conventional techniques of 

chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are 
25 within the skill of the art. Such techniques are explained fully in the literature. See e.g., 

Sambrook, Fritsch, and Maniatis, Molecular Cloning; Laboratory Manual 2nd ed. (1989); 

DNA Cloning, Volumes I and II (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 



Attorney Docket: PATH03-16 



ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds. 1984); the series, 
Methods in Enzymoloqy (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and 
Grossman, eds.); PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 1991); 
Immunology, 2d Edition, 1989, Roitt et aL, C.V. Mosby Company, and New York; Advanced 
5 Immunology, 2d Edition, 1991, Male et aL, Grower Medical Publishing, New York.; DNA 
Cloning: A Practical Approach, Volumes I and II, 1985 (D.N. Glover ed.); Oligonucleotide 
Synthesis, 1984, (M.L. Gait ed); Transcription and Translation, 1984 (Hames and Higgins 
eds.); Animal Cell Culture, 1986 (R.I. Freshney ed.); Immobilized Cells and Enzymes, 1986 
(IRL Press); Perbal, 1984, A Practical Guide to Molecular Cloning; Gene Transfer Vectors 

10 for Mammalian Cells, 1987 (J. H. Miller and M. P. Calos eds., Cold Spring Harbor 
Laboratory); Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, 
Academic Press, San Diego, CA. (1998); and Leonard F. Peruski, Jr., and Anne Harwood 
Peruski, The Internet and the New Biology: Tools for Genomic and Molecular Research, 
American Society for Microbiology, Washington, D.C. (1997). 

1 5 Any suitable materials and/or methods known to those of skill can be utilized in 

carrying out the present invention; however, preferred materials and/or methods are described. 
Materials, reagents and the like to which reference is made in the following description and 
examples are obtainable from commercial sources, unless otherwise noted. 

20 S. epidermidis Genomic Sequence 

This invention provides nucleotide sequences of the genome of & epidermidis which 
thus comprises a DNA sequence library of S. epidermidis genomic DNA. The detailed 
description that follows provides nucleotide sequences of S. epidermidis , and also describes 
how the sequences were obtained and how ORFs and protein-coding sequences were 

25 identified. Also described are compositions and methods of using the disclosed S. 
epidermidis sequences in methods including diagnostic and therapeutic applications. 
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Furthermore, the library can be used as a database for identification and comparison of 
medically important sequences in this and other strains of S. epidermidis . 

To determine the genomic sequence of S, epidermidis, DNA from strain 1 8972 of S. 
epidermidis was isolated and a library of DNA fragments were transformed into DH5a cells. 
5 DNA sequencing was achieved using established ABI sequencing methods on ABI377 
automated DNA sequencers. The cloning and sequencing procedures are described in more 
detail in the Exemplification. 

Individual sequence reads were assembled using PHRAP (P. Green, Abstracts of DOE 
Human Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157). The average 
10 contig length was about 3-4 kb. 

All subsequent steps were based on sequencing by ABI377 automated DNA 
sequencing methods. The cloning and sequencing procedures are described in more detail in 
the Exemplification. 

A variety of approaches may be used to order the contigs so as to obtain a continuous 
15 sequence representing the entire S. epidermidis genome. Synthetic oligonucleotides are 
designed that are complementary to sequences at the end of each contig. These 
oligonucleotides may be hybridized to libaries of S. epidermidis genomic DNA in, for 
example, lambda phage vectors or plasmid vectors to identify clones that contain sequences 
corresponding to the junctional regions between individual contigs. Such clones are then used 
20 to isolate template DNA and the same oligonucleotides are used as primers in polymerase 
chain reaction (PCR) to amplify junctional fragments, the nucleotide sequence of which is 
then determined. 

The S. epidermidis sequences were analyzed for the presence of open reading frames 
(ORFs) comprising at least 180 nucleotides. As a result of the analysis of ORFs based on 
25 stop-to-stop codon reads, it should be understood that these ORFs may not correspond to the 
ORF of a naturally-occurring S. epidermidis polypeptide. These ORFs may contain start 
codons which indicate the initiation of protein synthesis of a naturally-occurring S. 
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epidermidis polypeptide. Such start codons within the ORFs provided herein were identified 
by those of ordinary skill in the relevant art, and the resulting ORF and the encoded S. 
epidermidis polypeptide is within the scope of this invention. For example, within the ORFs 
a codon such as AUG or GUG (encoding methionine or valine) which is part of the initiation 
5 signal for protein synthesis were identified and the portion of an ORF to corresponding to a 
naturally-occurring & epidermidis polypeptide was recognized. The predicted coding regions 
were defined by evaluating the coding potential of such sequences with the program 
GENEMARK™ (Borodovsky and Mclninch, 1993, Comp. . 17:123). 

Each predicted ORF amino acid sequence was compared with all sequences found in 

10 current GENBANK, SWISS-PROT, and PIR databases using the BLAST algorithm. BLAST 
identifies local alignments occurring by chance between the ORF sequence and the sequence 
in the databank (Altschal et al., 1990, L Mol. Biol. 215:403-410). Homologous ORFs 
(probabilities less than 10" 5 by chance) andORF's that are probably non-homologous 
(probabilities greater than 10" 5 by chance) but have good codon usage were identified. Both 

15 homologous, sequences and non-homologous sequences with good codon usage, are likely to 
encode proteins and are encompassed by the invention. 

S. epidermidis Plasmid Sequences 

This invention also provides nucleotide sequences of two naturally occurs plasmids 

20 which thus comprises a DNA sequence library of S. epidermidis plasmid DNA. One plasmid 
disclosed is approximately 39 Kb and nucleic acid sequence is contained within SEQ ID NO: 
3703 - SEQ ID NO: 3764. The other plasmid is approximately 2.9 Kb contained within SEQ 
ID NO: 3765 - SEQ ID NO: 3772. The detailed description that follows provides nucleotide 
sequences of S. epidermidis, and also describes how the sequences were obtained and how 

25 ORFs and protein-coding sequences were identified. Also described are compositions and 
methods of using the disclosed £ epidermidis sequences in methods including diagnostic and 
therapeutic applications. The plasmid sequences can also be used as vectors and gene 
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expression. Furthermore, the plasmid library can be used as a database for identification and 
comparison of medically important sequences in this and other strains of S. epidermidis. 

Similar methods were used to determine to plasmid sequences of S. epidermidis as 
described above in determining the genomic sequence. A more detailed description of the 
5 methods are in the Exemplification. 

S. epidermidis Nucleic Acids 

The present invention provides a library of S. epidermidis -derived nucleic acid 
sequences. The libraries provide probes, primers, and markers which are used as markers in 

10 epidemiological studies. The present invention also provides a library of S. epidermidis - 
derived nucleic acid sequences which comprise or encode targets for therapeutic drugs. 

The nucleic acids of this invention may be obtained directly from the DNA of the 
above referenced S. epidermidis strain by using the polymerase chain reaction (PCR). See 
"PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, UK, 

15 1 99 1 ) for details about the PCR. High fidelity PCRis used to ensure a faithful DNA copy 
prior to expression. In addition, the authenticity of amplified products is verified by 
conventional sequencing methods. Clones carrying the desired sequences described in this 
invention may also be obtained by screening the libraries by means of the PCR or by 
hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 

20 plaques as known in the art (see, e.g., Sambrook et al., Molecular Cloning, A Laboratory 
Manual 2nd edition, 1989, Cold Spring Harbor Press, NY). 

It is also possible to obtain nucleic acids encoding S. epidermidis polypeptides from a 
cDNA library in accordance with protocols herein described. A cDNA encoding an S. 
epidermidis polypeptide can be obtained by isolating total mRNA from an appropriate strain. 

25 Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the 

cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using any 
one of a number of known techniques. Genes encoding S. epidermidis polypeptides can also 
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be cloned using established polymerase chain reaction techniques in accordance with the 
nucleotide sequence information provided by the invention. The nucleic acids of the 
invention can be DNA or RNA. Preferred nucleic acids of the invention are contained in the 
Sequence Listing. 

5 The nucleic acids of the invention can also be chemically synthesized using standard 

techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, 
including solid-phase synthesis which, like peptide synthesis, has been fully automated in 
commercially available DNA synthesizers (See e.g., Itakura et al. U.S. Patent No. 4,598,049; 
Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent Nos. 4,401,796 and 

10 4,373,071, incorporated by reference herein). 

In another example, DNA can be chemically synthesized using, e.g., the 
phosphoramidite solid support method of Matteucci et al, 1981, J. Am, Chem. Soc. 103:3185, 
the method of Yoo et al, 1989, 1 Biol Chem. 764:17078, or other well known methods. This 
can be done by sequentially linking a series of oligonucleotide cassettes comprising pairs of 

1 5 synthetic oligonucleotides, as described below. 

Nucleic acids isolated or synthesized in accordance with features of the present 
invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of proteins 
and peptides corresponding to such sequences. As probes, primers, capture ligands and 

20 antisense agents, the nucleic acid normally consists of all or part (approximately twenty or 
more nucleotides for specificity as well as the ability to form stable hybridization products) of 
the nucleic acids of the invention contained in the Sequence Listing. These uses are described 
in further detail below. 

25 Probes 

A nucleic acid isolated or synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can be used as a probe to specifically detect S. 
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epidermidis . With the sequence information set forth in the present application, sequences 
of twenty or more nucleotides are identified which provide the desired inclusivity and 
exclusivity with respect to S. epidermidis , and extraneous nucleic acids likely to be 
encountered during hybridization conditions. More preferably, the sequence will comprise at 
5 least about twenty to thirty nucleotides to convey stability to the hybridization product formed 
between the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
generated by recombinant DNA techniques. Individuals skilled in the art will readily 
recognize that the nucleic acids, for use as probes, can be provided with a label to facilitate 
1 0 detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with the sequence of the invention 
contained in the Sequence Listing can also be useful as probes to detect homologous regions 
(especially homologous genes) of other Staphylococcus species using appropriate stringency 
hybridization conditions as described herein. 

15 

Capture Ligand 

For use as a capture ligand, the nucleic acid selected in the manner described above 
with respect to probes, can be readily associated with a support. The manner in which nucleic 
acid is associated with supports is well known. Nucleic acid having twenty or more 

20 nucleotides in a sequence of the invention contained in the Sequence Listing have utility to 
separate S. epidermidis nucleic acid from one strain from the nucleic acid of other another 
strain as well as from other organisms. Nucleic acid having twenty or more nucleotides in a 
sequence of the invention contained in the Sequence Listing can also have utility to separate 
other Staphylococcus species from each other and from other organisms. Preferably, the 

25 sequence will comprise at least about twenty nucleotides to convey stability to the 
hybridization product formed between the probe and the intended target molecules. 
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Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
generated by recombinant DNA techniques. 



Primers 

5 Nucleic acid isolated or synthesized in accordance with the sequences described herein 

have utility as primers for the amplification of S. epidermidis nucleic acid. These nucleic 
acids may also have utility as primers for the amplification of nucleic acids in other 
Staphylococcus species. With respect to polymerase chain reaction (PCR) techniques, 
nucleic acid sequences of > 10-15 nucleotides of the invention contained in the Sequence 

10 Listing have utility in conjunction with suitable enzymes and reagents to create copies of S. 
epidermidis nucleic acid. More preferably, the sequence will comprise twenty or more 
nucleotides to convey stability to the hybridization product formed between the primer and the 
intended target molecules. Binding conditions of primers greater than 100 nucleotides are 
more difficult to control to obtain specificity. High fidelity PCR can be used to ensure a 

15 faithful DNA copy prior to expression. In addition, amplified products can be checked by 
conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, including 
genes from S. epidermidis and/or other Staphylococcus species. The copies can also be 
incorporated into cloning and expression vectors to generate polypeptides corresponding to 

20 the nucleic acid synthesized by PCR, as is described in greater detail herein. 

The nucleic acids of the present invention find use as templates for the recombinant 
production of S. epidermidis -derived peptides or polypeptides 



Antisense 

25 Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 

accordance with the sequences described herein have utility as antisense agents to prevent the 
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expression of S. epidermidis genes. These sequences also have utility as antisense agents to 

prevent expression of genes of other Staphylococcus species. 

In one embodiment, nucleic acid or derivatives corresponding to & epidermidis 

nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for 
5 introduction into bacterial cells. For example, a nucleic acid having twenty or more 

nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. 

Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide 

necessary stability of a hybridization product of non-naturally occurring nucleic acid and 

bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence 
10 greater than 1000 nucleotides in length is difficult to synthesize but can be generated by 

recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes is 

known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 to 

Papahadjopoulos et al. 

The present invention encompasses isolated polypeptides and nucleic acids derived 
15 from S. epidermidis that are useful as reagents for diagnosis of bacterial infection, 

components of effective anti-bacterial vaccines, and/or as targets for anti-bacterial drugs, 

including anti-S epidermidis drugs. 

Expression of S. epidermidis Nucleic Acids 
20 Table 2, which is appended herewith and which forms part of the present 

specification, provides a list of open reading frames (ORFs) in both strands and a putative 

identification of the particular function of a polypeptide which is encoded by each ORF, based 

on the homology match (determined by the BLASTP2 algorithm) of the predicted polypeptide 

with known proteins encoded by ORFs in other organisms. An ORF is a region of nucleic 

25 acid which encodes a polypeptide. This region may represent a portion of a coding sequence 

or a total sequence and was determined from stop to stop codons. The first column contains a 
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designation for the ORF ("ORF Name"). The second and third columns list the SEQ ID 
numbers for the nucleic acid ( M NT ID") and amino acid ("AA ID") sequences corresponding 
to each ORF, respectively. The fourth and fifth columns list the length of the nucleic acid 
ORF ("NT LN") and the length of the amino acid ORF ("AA LN"), respectively. The 
5 nucleotide sequence corresponding to each ORF begins at the first nucleotide immediately 
following a stop codon and ends at the nucleotide immediately preceding the next downstream 
stop codon in the same reading frame. It will be recognized by one skilled in the art that the 
natural translation initiation sites will correspond to ATG, GTG, or TTG codons located 
within the ORFs. The natural initiation sites depend not only on the sequence of a start codon 

10 but also on the context of the DNA sequence adjacent to the start codon. Usually, a 

recognizable ribosome binding site is found within 20 nucleotides upstream from the initiation 
codon. In some cases where genes are translationally coupled and coordinately expressed 
together in "operons", ribosome binding sites are not present, but the initiation codon of a 
downstream gene may occur very close to, or overlap, the stop codon of the an upstream gene 

1 5 in the same operon. The correct start codons can be generally identified without undue 
experimentation because only a few codons need be tested. It is recognized that the 
translational machinery in bacteria initiates all polypeptide chains with the amino acid 
methionine, regardless of the sequence of the start codon. In some cases, polypeptides are 
post-translationally modified, resulting in an N-terminal amino acid other than methionine in 

20 vivo. The sixth and seventh columns provide metrics for assessing the likelihood of the 

homology match (determined by the BLASTP2 algorithm), as is known in the art, to the genes 
indicated in the description frame ("Desciption") below the ORF Name. These genes in the 
Description were identified when the designated ORF was compared against a comprehensive 
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non-redundant protein database. Specifically, the sixth column represents the Blast Score 
("Score") for the match (a higher score is a better match), and the seventh column represents 
the probability ("P-value") for the match (the probability that such a match can have occurred 
by chance; the lower the value, the more likely the match is valid). If a BLASTP2 score of 
5 less than 100 was obtained, no value is reported in the table. The Description provides, where 
available, the Swissprot accession number (SP), the locus name (LN), the Organism (OR), 
Source of variant (SR), E.C. number (EC), the gene name (GN), the product name (PN), the 
Function Description (FN), Left End (LE), Right End (RE), Coding Direction (DI), and the 
description (DE) or notes (NT) for each ORF. This information allows one of ordinary skill 
10 in the art to determine a potential use for each identified coding sequence and, as a result, 
allows to use the polypeptides of the present invention for commercial and industrial 
purposes. 

Using the information provided in SEQ ID NO: 1 - SEQ ID NO: 3772, SEQ ID NO: 

3773 - SEQ ID NO: 7544 and in Table 2 together with routine cloning and sequencing 
15 methods, one of ordinary skill in the art will be able to clone and sequence all the nucleic acid 

fragments of interest including open reading frames (ORFs) encoding a large variety of 

proteins of S. epidermidis. 

Nucleic acid isolated or synthesized in accordance with the sequences described herein 

have utility to generate polypeptides. The nucleic acid of the invention exemplified in SEQ 
20 ID NO: 1 - SEQ ID NO: 3772 and in Table 2 or fragments of said nucleic acid encoding active 

portions of S. epidermidis polypeptides can be cloned into suitable vectors or used to isolate 

nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and cloned into 

a suitable vector. 
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The function of a specific gene or operon can be ascertained by expression in a 
bacterial strain under conditions where the activity of the gene product(s) specified by the 
gene or operon in question can be specifically measured. Alternatively, a gene product may 
be produced in large quantities in an expressing strain for use as an antigen, an industrial 
5 reagent, for structural studies, etc. This expression can be accomplished in a mutant strain 
which lacks the activity of the gene to be tested, or in a strain that does not produce the same 
gene product(s). This includes, but is not limited to, Eucaryotic species such as the yeast 
Saccharomyces cerevisiae, Methanobacterium strains or other Archaea, and Eubacteria such 
as E. coli, B. Subtilis, S. Aureus, S. Pneumonia or Pseudomonas putida. In some cases the 

10 expression host will utilize the natural S. epidermidis promoter whereas in others, it will be 
necessary to drive the gene with a promoter sequence derived from the expressing organism 
(e.g., an E. coli beta-galactosidase promoter for expression in E. coli). 

To express a gene product using the natural S. epidermidis promoter, a procedure such 
as the following can be used. A restriction fragment containing the gene of interest, together 

15 with its associated natural promoter element and regulatory sequences (identified using the 
DNA sequence data) is cloned into an appropriate recombinant plasmid containing an origin 
of replication that functions in the host organism and an appropriate selectable marker. This 
can be accomplished by a number of procedures known to those skilled in the art. It is most 
preferably done by cutting the plasmid and the fragment to be cloned with the same restriction 

20 enzyme to produce compatible ends that can be ligated to join the two pieces together. The 
recombinant plasmid is introduced into the host organism by, for example, electroporation and 
cells containing the recombinant plasmid are identified by selection for the marker on the 
plasmid. Expression of the desired gene product is detected using an assay specific for that 
gene product. 

25 In the case of a gene that requires a different promoter, the body of the gene (coding 

sequence) is specifically excised and cloned into an appropriate expression plasmid. This 
subcloning can be done by several methods, but is most easily accomplished by PCR 
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amplification of a specific fragment and ligation into an expression plasmid after treating the 
PCR product with a restriction enzyme or exonuclease to create suitable ends for cloning. 

A suitable host cell for expression of a gene can be any procaryotic or eucaryotic cell. 
Suitable methods for transforming host cells can be found in Sambrook et al. ( Molecular 
5 Cloning: A Laboratory Manual 2nd Edition, Cold Spring Harbor Laboratory Press (1989)), 
and other laboratory textbooks. 

For example, a host cell transfected with a nucleic acid vector directing expression of a 
nucleotide sequence encoding an S. epidermidis polypeptide can be cultured under 
appropriate conditions to allow expression of the polypeptide to occur. Suitable media for cell 

10 culture are well known in the art. Polypeptides of the invention can be isolated from cell 
culture medium, host cells, or both using techniques known in the art for purifying proteins 
including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, 
electrophoresis, and immunoaffinity purification with antibodies specific for such 
polypeptides. Additionally, in many situations, polypeptides can be produced by chemical 

1 5 cleavage of a native protein (e.g., tryptic digestion) and the cleavage products can then be 
purified by standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a solubilized 
complex, where the membrane-associated protein is no longer entirely embedded in the 

20 membrane fraction and is solubilized at least to an extent which allows it to be 

chromatographically isolated from the membrane fraction. Chromatographic techniques 
which can be used in the final purification step are known in the art and include hydrophobic 
interaction, lectin affinity, ion exchange, dye affinity and immunoaffinity. 

One strategy to maximize recombinant S. epidermidis peptide expression in E. coli is 

25 to express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in Enzymology 
185 , Academic Press, San Diego, California (1990) 1 19-128). Another strategy would be to 
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alter the nucleic acid encoding an & epidermidis peptide to be inserted into an expression 
vector so that the individual codons for each amino acid would be those preferentially utilized 
in highly expressed E. coli proteins (Wada et al, (1992) Nuc. Acids Res. 20:21 1 1-21 18). Such 
alteration of nucleic acids of the invention can be carried out by standard DNA synthesis 
5 techniques. 

The nucleic acids of the invention can also be chemically synthesized using standard 
techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, 
including solid-phase synthesis which, like peptide synthesis, has been fully automated in 
commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. Patent No. 4,598,049; 
10 Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent Nos. 4,401,796 and 
4,373,071, incorporated by reference herein). 

The present invention provides a library of S. epidermidis -derived nucleic acid 
sequences. The libraries provide probes, primers, and markers which can be used as markers 
in epidemiological studies. The present invention also provides a library of S. epidermidis - 
1 5 derived nucleic acid sequences which comprise or encode targets for therapeutic drugs. 

Nucleic acids comprising any of the sequences disclosed herein or sub-sequences 
thereof can be prepared by standard methods using the nucleic acid sequence information 
provided in SEQ ID NO: 1 - SEQ ID NO: 3772. For example, DNA can be chemically 
synthesized using, e.g., the phosphoramidite solid support method of Matteucci et al, 1981, J. 
20 Am. Chem. Soc. 103:3185, the method of Yoo et al, 1989, J. Biol Chem. 764:17078, or other 
well known methods. This can be done by sequentially linking a series of oligonucleotide 
cassettes comprising pairs of synthetic oligonucleotides, as described below. 

Of course, due to the degeneracy of the genetic code, many different nucleotide 
sequences can encode polypeptides having the amino acid sequences defined by SEQ ID NO: 
25 3773 - SEQ ID NO: 7544 or sub-sequences thereof. The codons can be selected for optimal 
expression in prokaryotic or eukaryotic systems. Such degenerate variants are also 
encompassed by this invention. 
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Insertion of nucleic acids (typically DNAs) encoding the polypeptides of the invention 
into a vector is easily accomplished when the termini of both the DNAs and the vector 
comprise compatible restriction sites. If this cannot be done, it may be necessary to modify 
the termini of the DNAs and/or vector by digesting back single-stranded DNA overhangs 
5 generated by restriction endonuclease cleavage to produce blunt ends, or to achieve the same 
result by filling in the single-stranded termini with an appropriate DNA polymerase. 

Alternatively, any site desired may be produced, e.g., by ligating nucleotide sequences 
(linkers) onto the termini. Such linkers may comprise specific oligonucleotide sequences that 
define desired restriction sites. Restriction sites can also be generated by the use of the 

10 polymerase chain reaction (PCR). See, e.g., Saiki et aL, 1988, Science 239:48. The cleaved 
vector and the DNA fragments may also be modified if required by homopolymeric tailing. 

The nucleic acids of the invention may be isolated directly from cells. Alternatively, 
the polymerase chain reaction (PCR) method can be used to produce the nucleic acids of the 
invention, using either chemically synthesized strands or genomic material as templates. 

1 5 Primers used for PCR can be synthesized using the sequence information provided herein and 
can further be designed to introduce appropriate new restriction sites, if desirable, to facilitate 
incorporation into a given vector for recombinant expression. 

The nucleic acids of the present invention may be flanked by natural S. epidermidis 
regulatory sequences, or may be associated with heterologous sequences, including promoters, 

20 enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5'- and 
3 f - noncoding regions, and the like. The nucleic acids may also be modified by many means 
known in the art. Non-limiting examples of such modifications include methylation, "caps", 
substitution of one or more of the naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those with uncharged linkages (e.g., 

25 methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Nucleic acids may 
contain one or more additional covalently linked moieties, such as, for example, proteins (e.g., 
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nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, 
psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, etc.), and 
alkylators. PNAs are also included. The nucleic acid may be derivatized by formation of a 
methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. Furthermore, the 
5 nucleic acid sequences of the present invention may also be modified with a label capable of 
providing a detectable signal, either directly or indirectly. Exemplary labels include 
radioisotopes, fluorescent molecules, biotin, and the like. 

The invention also provides nucleic acid vectors comprising the disclosed S. 
epidermidis -derived sequences or derivatives or fragments thereof. A large number of 
10 vectors, including plasmid and bacterial vectors, have been described for replication and/or 
expression in a variety of eukaryotic and prokaryotic hosts, and may be used for cloning or 
protein expression. 

The encoded S. epidermidis polypeptides may be expressed by using many known 
vectors, such as pUC plasmids, pET plasmids (Novagen, Inc., Madison, WI), or pRSET or 

15 pREP (Invitrogen, San Diego, CA), and many appropriate host cells, using methods disclosed 
or cited herein or otherwise known to those skilled in the relevant art. The particular choice 
of vector/host is not critical to the practice of the invention. 

Recombinant cloning vectors will often include one or more replication systems for 
cloning or expression, one or more markers for selection in the host, e.g. antibiotic resistance, 

20 and one or more expression cassettes. The inserted S. epidermidis coding sequences may be 
synthesized by standard methods, isolated from natural sources, or prepared as hybrids, etc. 
Ligation of the S. epidermidis coding sequences to transcriptional regulatory elements and/or 
to other amino acid coding sequences may be achieved by known methods. Suitable host cells 
may be transformed/transfected/infected as appropriate by any suitable method including 

25 electroporation, CaC^ mediated DNA uptake, bacterial infection, microinjection, 
microprojectile, or other established methods. 
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Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, and plant 
and animal cells, especially mammalian cells. Of particular interest are S. epidermidis , E. 
coli, B. Subtilis, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, 
Schizosaccharomyces pombi, SF9 cells, CI 29 cells, 293 cells, Neurospora, and CHO cells, 
5 COS cells, HeLa cells, and immortalized mammalian myeloid and lymphoid cell lines. 
Preferred replication systems include Ml 3, ColEl, SV40, baculovirus, lambda, adenovirus, 
and the like. A large number of transcription initiation and termination regulatory regions 
have been isolated and shown to be effective in the transcription and translation of 
heterologous proteins in the various hosts. Examples of these regions, methods of isolation, 
10 manner of manipulation, etc. are known in the art. Under appropriate expression conditions, 
host cells can be used as a source of recombinantly produced S. epidermidis -derived peptides 
and polypeptides. 

Advantageously, vectors may also include a transcription regulatory element (i.e., a 
promoter) operably linked to the S. epidermidis portion. The promoter may optionally 

1 5 contain operator portions and/or ribosome binding sites. Non-limiting examples of bacterial 
promoters compatible with E. coli include: b-lactamase (penicillinase) promoter; lactose 
promoter; tryptophan (tip) promoter; araBAD (arabinose) operon promoter; lambda-derived Pi 
promoter and N gene ribosome binding site; and the hybrid tac promoter derived from 
sequences of the trp and lac UV5 promoters. Non-limiting examples of yeast promoters 

20 include 3-phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate dehydrogenase 

(GAPDH) promoter, galactokinase (GAL1) promoter, galactoepimerase promoter, and alcohol 
dehydrogenase (ADH) promoter. Suitable promoters for mammalian cells include without 
limitation viral promoters such as that from Simian Virus 40 (SV40), Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Mammalian cells may also 

25 require terminator sequences, polyA addition sequences and enhancer sequences to increase 
expression. Sequences which cause amplification of the gene may also be desirable. 
Furthermore, sequences that facilitate secretion of the recombinant product from cells, 
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including, but not limited to, bacteria, yeast, and animal cells, such as secretory signal 
sequences and/or prohormone pro region sequences, may also be included. These sequences 
are well described in the art. 

Nucleic acids encoding wild-type or variant S. epidermidis -derived polypeptides may 
5 also be introduced into cells by recombination events. For example, such a sequence can be 
introduced into a cell, and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other recombination- 
based methods such as nonhomologous recombinations or deletion of endogenous genes by 
homologous recombination may also be used. 
10 The nucleic acids of the present invention find use as templates for the recombinant 

production of S. epidermidis -derived peptides or polypeptides. 

Identification and Use of S. epidermidis Nucleic Acid Sequences 

The disclosed S. epidermidis polypeptide and nucleic acid sequences, or other 

15 sequences that are contained within ORFs, including complete protein-coding sequences, of 
which any of the disclosed S. epidermidis -specific sequences forms a part, are useful as 
target components for diagnosis and/or treatment of S. epidermidis - caused infection 

It will be understood that the sequence of an entire protein-coding sequence of which 
each disclosed nucleic acid sequence forms a part can be isolated and identified based on each 

20 disclosed sequence. This can be achieved, for example, by using an isolated nucleic acid 
encoding the disclosed sequence, or fragments thereof, to prime a sequencing reaction with 
genomic S. epidermidis DNA as template; this is followed by sequencing the amplified 
product. The isolated nucleic acid encoding the disclosed sequence, or fragments thereof, can 
also be hybridized to S. epidermidis genomic libraries to identify clones containing additional 

25 complete segments of the protein-coding sequence of which the shorter sequence forms a part. 
Then, the entire protein-coding sequence, or fragments thereof, or nucleic acids encoding all 
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or part of the sequence, or sequence-conservative or function-conservative variants thereof, 
may be employed in practicing the present invention. 

Preferred sequences are those that are useful in diagnostic and/or therapeutic 
applications. Diagnostic applications include without limitation nucleic-acid-based and 
5 antibody-based methods for detecting bacterial infection. Therapeutic applications include 
without limitation vaccines, passive immunotherapy, and drug treatments directed against 
gene products that are both unique to bacteria and essential for growth and/or replication of 
bacteria. 

10 Identification of Nucleic Acids Encoding Vaccine Components and Targets for Agents 
Effective Against S. epidermidis 

The disclosed S. epidermidis genome sequence includes segments that direct the 
synthesis of ribonucleic acids and polypeptides, as well as origins of replication, promoters, 
other types of regulatory sequences, and intergenic nucleic acids. The invention encompasses 

15 nucleic acids encoding immunogenic components of vaccines and targets for agents effective 
against S. epidermidis . Identification of said immunogenic components involved in the 
determination of the function of the disclosed sequences, which can be achieved using a 
variety of approaches. Non-limiting examples of these approaches are described briefly 
below. 

20 

Homology to known sequences: 

Computer-assisted comparison of the disclosed S. epidermidis sequences with 
previously reported sequences present in publicly available databases is useful for identifying 
functional S. epidermidis nucleic acid and polypeptide sequences. It will be understood that 
25 protein-coding sequences, for example, may be compared as a whole, and that a high degree 
of sequence homology between two proteins (such as, for example, >80-90%) at the amino 
acid level indicates that the two proteins also possess some degree of functional homology, 
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such as, for example, among enzymes involved in metabolism, DNA synthesis, or cell wall 
synthesis, and proteins involved in transport, cell division, etc. In addition, many structural 
features of particular protein classes have been identified and correlate with specific 
consensus sequences, such as, for example, binding domains for nucleotides, DNA, metal 
5 ions, and other small molecules; sites for covalent modifications such as phosphorylation, 
acylation, and the like; sites of proteimprotein interactions, etc. These consensus sequences 
may be quite short and thus may represent only a fraction of the entire protein-coding 
sequence. Identification of such a feature in an S. epidermidis sequence is therefore useful in 
determining the function of the encoded protein and identifying useful targets of antibacterial 
10 drugs. 

Of particular relevance to the present invention are structural features that are common 
to secretory, transmembrane, and surface proteins, including secretion signal peptides and 
hydrophobic transmembrane domains. S. epidermidis proteins identified as containing 
putative signal sequences and/or transmembrane domains are useful as immunogenic 

1 5 components of vaccines. 

Targets for therapeutic drugs according to the invention include, but are not limited to, 
polypeptides of the invention, whether unique to S. epidermidis or not, that are essential for 
growth and/or viability of S. epidermidis under at least one growth condition. Polypeptides 
essential for growth and/or viability can be determined by examining the effect of deleting 

20 and/or disrupting the genes, i.e., by so-called gene "knockout". Alternatively, genetic 
footprinting can be used (Smith et al, 1995, Proc. Natl Acad ScL USA 92:5479-6433; 
Published International Application WO 94/26933; U.S. Patent No. 5,612,180). Still other 
methods for assessing essentiality includes the ability to isolate conditional lethal mutations in 
the specific gene (e.g., temperature sensitive mutations). Other useful targets for therapeutic 

25 drugs, which include polypeptides that are not essential for growth or viability per se but lead 
to loss of viability of the cell, can be used to target therapeutic agents to cells. 
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Strain-specific sequences: 

Because of the evolutionary relationship between different S. epidermidis strains, it is 
believed that the presently disclosed & epidermidis sequences are useful for identifying, 
and/or discriminating between, previously known and new S. epidermidis strains. It is 
5 believed that other S. epidermidis strains will exhibit at least about 70% sequence homology 
with the presently disclosed sequence. Systematic and routine analyses of DNA sequences 
derived from samples containing S. epidermidis strains, and comparison with the present 
sequence allows for the identification of sequences that can be used to discriminate between 
strains, as well as those that are common to all S. epidermidis strains. In one embodiment, 
10 the invention provides nucleic acids, including probes, and peptide and polypeptide sequences 
that discriminate between different strains of S. epidermidis . Strain-specific components can 
also be identified functionally by their ability to elicit or react with antibodies that selectively 
recognize one or more S. epidermidis strains. 

In another embodiment, the invention provides nucleic acids, including probes, and 
15 peptide and polypeptide sequences that are common to all S. epidermidis strains but are not 
found in other bacterial species. 

S. epidermidis Polypeptides 

This invention encompasses isolated S. epidermidis polypeptides encoded by the 

20 disclosed S. epidermidis genomic sequences, including the polypeptides of the invention 
contained in the Sequence Listing. Polypeptides of the invention are preferably at least about 
5 amino acid residues in length. Using the DNA sequence information provided herein, the 
amino acid sequences of the polypeptides encompassed by the invention can be deduced using 
methods well-known in the art. It will be understood that the sequence of an entire nucleic 

25 acid encoding an S. epidermidis polypeptide can be isolated and identified based on an ORF 
that encodes only a fragment of the cognate protein-coding region. This can be achieved, for 
example, by using the isolated nucleic acid encoding the ORF, or fragments thereof, to prime 
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a polymerase chain reaction with genomic S. epidermidis DNA as template; this is followed 
by sequencing the amplified product. 

The polypeptides of the present invention, including function-conservative variants of 
the disclosed ORFs, may be isolated from wild-type or mutant S. epidermidis cells, or from 
5 heterologous organisms or cells (including, but not limited to, bacteria, fungi, insect, plant, 
and mammalian cells) including S. epidermidis into which an S. epidermidis -derived 
protein-coding sequence has been introduced and expressed. Furthermore, the polypeptides 
may be part of recombinant fusion proteins. 

S. epidermidis polypeptides of the invention can be chemically synthesized using 

10 commercially automated procedures such as those referenced herein , including, without 
limitation, exclusive solid phase synthesis, partial solid phase methods, fragment 
condensation or classical solution synthesis. The polypeptides are preferably prepared by 
solid phase peptide synthesis as described by Merrifield, 1963, J. Am. Chem. Soc. 85:2149. 
The synthesis is carried out with amino acids that are protected at the alpha-amino terminus. 

1 5 Trifunctional amino acids with labile side-chains are also protected with suitable groups to 
prevent undesired chemical reactions from occurring during the assembly of the polypeptides. 
The alpha-amino protecting group is selectively removed to allow subsequent reaction to take 
place at the amino-terminus. The conditions for the removal of the alpha-amino protecting 
group do not remove the side-chain protecting groups. 

20 Methods for polypeptide purification are well-known in the art, including, without 

limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase 
HPLC, gel filtration, ion exchange and partition chromatography, and countercurrent 
distribution. For some purposes, it is preferable to produce the polypeptide in a recombinant 
system in which the S. epidermidis protein contains an additional sequence tag that facilitates 

25 purification, such as, but not limited to, a polyhistidine sequence. The polypeptide can then 
be purified from a crude lysate of the host cell by chromatography on an appropriate solid- 
phase matrix. Alternatively, antibodies produced against an & epidermidis protein or against 
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peptides derived therefrom can be used as purification reagents. Other purification methods 
are possible. 

The present invention also encompasses derivatives and homologues of S. epidermidis 
-encoded polypeptides. For some purposes, nucleic acid sequences encoding the peptides may 
5 be altered by substitutions, additions, or deletions that provide for functionally equivalent 
molecules, i.e., function-conservative variants. For example, one or more amino acid residues 
within the sequence can be substituted by another amino acid of similar properties, such as, 
for example, positively charged amino acids (arginine, lysine, and histidine); negatively 
charged amino acids (aspartate and glutamate); polar neutral amino acids; and non-polar 
10 amino acids. 

The isolated polypeptides may be modified by, for example, phosphorylation, 
sulfation, acylation, or other protein modifications. They may also be modified with a label 
capable of providing a detectable signal, either directly or indirectly, including, but not limited 
to, radioisotopes and fluorescent compounds. 

15 To identify S. epidermidis -derived polypeptides for use in the present invention, 

essentially the complete genomic sequence of a virulent, methicillin-resistant isolate of 
Staphylococcus mirabilis isolate was analyzed. While, in very rare instances, a nucleic acid 
sequencing error may be revealed, resolving a rare sequencing error is well within the art, and 
such an occurrence will not prevent one skilled in the art from practicing the invention. 

20 Also encompassed are any S. epidermidis polypeptide sequences that are contained 

within the open reading frames (ORFs), including complete protein-coding sequences, of 
which any of SEQ ID NO: 1 - SEQ ID NO: 3772 forms a part. Table 2, which is appended 
herewith and which forms part of the present specification, provides a putative identification 
of the particular function of a polypeptide which is encoded by each ORF, based on the 

25 homology match (determined by the BLAST algorithm) of the predicted polypeptide with 
known proteins encoded by ORFs in other organisms. As a result, one skilled in the art can 
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use the polypeptides of the present invention for commercial and industrial purposes 
consistent with the type of putative identification of the polypeptide. 

The present invention provides a library of S. epidermidis -derived polypeptide 
sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, 
5 wherein the polypeptides themselves, or polypeptides contained within ORFs of which they 
form a part, comprise sequences that are contemplated for use as components of vaccines. 
Non-limiting examples of such sequences are listed by SEQ ID NO in Table 2, which is 
appended herewith and which forms part of the present specification. 

The present invention also provides a library of S. epidermidis -derived polypeptide 

10 sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, 
wherein the polypeptides themselves, or polypeptides contained within ORFs of which they 
form a part, comprise sequences lacking homology to any known prokaryotic or eukaryotic 
sequences. Such libraries provide probes, primers, and markers which can be used to 
diagnose S. epidermidis infection, including use as markers in epidemiological studies. Non- 

15 limiting examples of such sequences are listed by SEQ ID NO in Table 2, which is appended 
hereto and part hereof. 

The present invention also provides a library of S. epidermidis -derived polypeptide 
sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, 
wherein the polypeptides themselves, or polypeptides contained within ORFs of which they 

20 form a part, comprise targets for therapeutic drugs. 



Specific Example: Determination Of Staphylococcus Protein Antigens For Antibody And 
Vaccine Development 

The selection of Staphylococcus protein antigens for vaccine development can be 
25 derived from the nucleic acids encoding S. epidermidis polypeptides. First, the ORF f s can be 
analyzed for homology to other known exported or membrane proteins and analyzed using the 
discriminant analysis described by Klein, et al (Klein, P., Kanehsia, M., and DeLisi, C. 

49 



Attorney Docket: PATH03-16 



(1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported and membrane 
proteins. 

Homology searches can be performed using the BLAST algorithm contained in the 

Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research Park, 

5 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF amino acid sequence 

with all sequences found in the current GenBank, SWISS-PROT and PIR databases. BLAST 

searches for local alignments between the ORF and the databank sequences and reports a 

probability score which indicates the probability of finding this sequence by chance in the 

-6 

database. ORFs with significant homology (e.g. probabilities lower than 1x10 that the 
10 homology is only due to random chance) to membrane or exported proteins represent protein 
antigens for vaccine development. Possible functions can be provided to S. epidermidis 
genes based on sequence homology to genes cloned in other organisms. 

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF amino acid 
sequences. This algorithm uses the intrinsic information contained in the ORF amino acid 
15 sequence and compares it to information derived from the properties of known membrane arid 
exported proteins. This comparison predicts which proteins will be exported, membrane 
associated or cytoplasmic. ORF amino acid sequences identified as exported or membrane 
associated by this algorithm are likely protein antigens for vaccine development. 

20 Production of Fragments and Analogs of S. epidermidis Nucleic Acids and Polypeptides 

Based on the discovery of the S. epidermidis gene products of the invention provided 
in the Sequence Listing, one skilled in the art can alter the disclosed structure of S. 
epidermidis genes, e.g., by producing fragments or analogs, and test the newly produced 
structures for activity. Examples of techniques known to those skilled in the relevant art 

25 which allow the production and testing of fragments and analogs are discussed below. These, 
or analogous methods can be used to make and screen libraries of polypeptides, e.g., libraries 
of random peptides or libraries of fragments or analogs of cellular proteins for the ability to 
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bind S. epidermidis polypeptides. Such screens are useful for the identification of inhibitors 
of S. epidermidis . 

Generation of Fragments 
5 Fragments of a protein can be produced in several ways, e.g., recombinantly, by 

proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes the 
polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. 

10 Digestion with "end-nibbling" endonucleases can thus generate DNAs which encode an array 
of fragments. DNAs which encode fragments of a protein can also be generated by random 
shearing, restriction digestion or a combination of the above-discussed methods. 

Fragments can also be chemically synthesized using techniques known in the art such 
as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, peptides of 

1 5 the present invention may be arbitrarily divided into fragments of desired length with no 
overlap of the fragments, or divided into overlapping fragments of a desired length. 

Alteration of Nucleic Acids and Polypeptides: Random Methods 

Amino acid sequence variants of a protein can be prepared by random mutagenesis of 
20 DNA which encodes a protein or a particular domain or region of a protein. Useful methods 
include PCR mutagenesis and saturation mutagenesis. A library of random amino acid 
sequence variants can also be generated by the synthesis of a set of degenerate oligonucleotide 
sequences. (Methods for screening proteins in a library of variants are elsewhere herein). 

25 PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random 
mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). The DNA 
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region to be mutagenized is amplified using the polymerase chain reaction (PCR) under 

conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., by using a 

2+ 

dGTP/dATP ratio of five and adding Mn to the PCR reaction. The pool of amplified DNA 
fragments are inserted into appropriate cloning vectors to provide random mutant libraries. 

5 

Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of single 
base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This 
technique includes generation of mutations, e.g., by chemical treatment or irradiation of 

10 single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The mutation 
frequency can be modulated by modulating the severity of the treatment, and essentially all 
possible base substitutions can be obtained. Because this procedure does not involve a genetic 
selection for mutant fragments both neutral substitutions, as well as those that alter function, 
are obtained. The distribution of point mutations is not biased toward conserved sequence 

15 elements. 

Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate oligonucleotide 
sequences. Chemical synthesis of a degenerate sequences can be carried out in an automatic 
20 DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector. 
The synthesis of degenerate oligonucleotides is known in the art (see for example, Narang, SA 

(1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland 
Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al. 

(1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) 
25 Nucleic Acid Res. 1 1 :477. Such techniques have been employed in the directed evolution of 

other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) 
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PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 
87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis 
5 Non-random or directed, mutagenesis techniques can be used to provide specific 

sequences or mutations in specific regions. These techniques can be used to create variants 
which include, e.g., deletions, insertions, or substitutions, of residues of the known amino acid 
sequence of a protein. The sites for mutation can be modified individually or in series, e.g., 
by (1) substituting first with conserved amino acids and then with more radical choices 
10 depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of 
the same or a different class adjacent to the located site, or combinations of options 1-3. 

Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain residues 
1 5 or regions of the desired protein that are preferred locations or domains for mutagenesis, 
Cunningham and Wells {Science 244:1081-1085, 1989). In alanine scanning, a residue or 
group of target residues are identified (e.g., charged residues such as Arg, Asp, His, Lys, and 
Glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or 
polyalanine). Replacement of an amino acid can affect the interaction of the amino acids with 
20 the surrounding aqueous environment in or outside the cell. Those domains demonstrating 
functional sensitivity to the substitutions are then refined by introducing further or other 
variants at or for the sites of substitution. Thus, while the site for introducing an amino acid 
sequence variation is predetermined, the nature of the mutation per se need not be 
predetermined. For example, to optimize the performance of a mutation at a given site, 
25 alanine scanning or random mutagenesis may be conducted at the target codon or region and 
the expressed desired protein subunit variants are screened for the optimal combination of 
desired activity. 



53 



Attorney Docket: PATH03-16 



Oligonucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, 
deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 2:183, 1983). 
5 Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation to a 
DNA template, where the template is the single-stranded form of a plasmid or bacteriophage 
containing the unaltered or native DNA sequence of the desired protein. After hybridization, 
a DNA polymerase is used to synthesize an entire second complementary strand of the 
template that will thus incorporate the oligonucleotide primer, and will code for the selected 

10 alteration in the desired protein DNA. Generally, oligonucleotides of at least about 25 

nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that 
are completely complementary to the template on either side of the nucleotide(s) coding for 
the mutation. This ensures that the oligonucleotide will hybridize properly to the single- 
stranded DNA template molecule. The oligonucleotides are readily synthesized using 

15 techniques known in the art such as that described by Crea et al. (Proc. Natl Acad Sci. USA, 
75: 5765[1978]). 

Cassette Mutagenesis 

Another method for preparing variants, cassette mutagenesis, is based on the technique 
20 described by Wells et al. (Gene, 34:315[1985]). The starting material is a plasmid (or other 
vector) which includes the protein subunit DNA to be mutated. The codon(s) in the protein 
subunit DNA to be mutated are identified. There must be a unique restriction endonuclease 
site on each side of the identified mutation site(s). If no such restriction sites exist, they may 
be generated using the above-described oligonucleotide-mediated mutagenesis method to 
25 introduce them at appropriate locations in the desired protein subunit DNA. After the 
restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to 
linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between 
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the restriction sites but containing the desired mutation(s) is synthesized using standard 
procedures. The two strands are synthesized separately and then hybridized together using 
standard techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3* and 5' ends that are comparable with the ends of the linearized 
5 plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the 
mutated desired protein subunit DNA sequence. 

Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et aL, WO 

10 88/06630). In this method, the amino acid sequences for a group of homologs or other related 
proteins are aligned, preferably to promote the highest homology possible. All of the amino 
acids which appear at a given position of the aligned sequences can be selected to create a 
degenerate set of combinatorial sequences. The variegated library of variants is generated by 
combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene 

1 5 library. For example, a mixture of synthetic oligonucleotides can be enzymatically ligated 
into gene sequences such that the degenerate set of potential sequences are expressible as 
individual peptides, or alternatively, as a set of larger fusion proteins containing the set of 
degenerate sequences. 

20 Other Modifications of S. epidermidis Nucleic Acids and Polypeptides 

It is possible to modify the structure of an S. epidermidis polypeptide for such 
purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and resistance to 
proteolytic degradation in vivo). A modified S. epidermidis protein or peptide can be 
produced in which the amino acid sequence has been altered, such as by amino acid 
25 substitution, deletion, or addition as described herein. 

An S. epidermidis peptide can also be modified by substitution of cysteine residues 
preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize 
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dimerization via disulfide linkages. In addition, amino acid side chains of fragments of the 
protein of the invention can be chemically modified. Another modification is cyclization of 
the peptide. 

In order to enhance stability and/or reactivity, an S. epidermidis polypeptide can be 
5 modified to incorporate one or more polymorphisms in the amino acid sequence of the protein 
resulting from any natural allelic variation. Additionally, D-amino acids, non-natural amino 
acids, or non-amino acid analogs can be substituted or added to produce a modified protein 
within the scope of this invention. Furthermore, an S. epidermidis polypeptide can be 
modified using polyethylene glycol (PEG) according to the method of A. Sehon and co- 

10 workers (Wie et al., supra) to produce a protein conjugated with PEG. In addition, PEG can 
be added during chemical synthesis of the protein. Other modifications of S. epidermidis 
proteins include reduction/alkylation (Tarr, Methods of Protein Microcharacterization, J. E. 
Silver ed., Humana Press, Clifton NJ 155-194 (1986)); acylation (Tarr, supra); chemical 
coupling to an appropriate carrier (Mishell and Shiigi, eds, Selected Methods in Cellular 

1 5 Immunology, WH Freeman, San Francisco, CA (1 980), U.S. Patent 4,939,239; or mild 

formalin treatment (Marsh, (1971) Int. Arch of Allergy andAppl. Immunol, 4J_: 199 - 215). 

To facilitate purification and potentially increase solubility of an S. epidermidis 
protein or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. 
For example, hexa-histidine can be added to the protein for purification by immobilized metal 

20 ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology, 6: 1321 - 1325). In 
addition, to facilitate isolation of peptides free of irrelevant sequences, specific endoprotease 
cleavage sites can be introduced between the sequences of the fusion moiety and the peptide. 

To potentially aid proper antigen processing of epitopes within an S. epidermidis 
polypeptide, canonical protease sensitive sites can be engineered between regions, each 

25 comprising at least one epitope via recombinant or synthetic methods. For example, charged 
amino acid pairs, such as KK or RR, can be introduced between regions within a protein or 
fragment during recombinant construction thereof. The resulting peptide can be rendered 
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sensitive to cleavage by cathepsin and/or other trypsin-like enzymes which would generate 
portions of the protein containing one or more epitopes. In addition, such charged amino acid 
residues can result in an increase in the solubility of the peptide. 

5 Primary Methods for Screening Polypeptides and Analogs 

Various techniques are known in the art for screening generated mutant gene products. 
Techniques for screening large gene libraries often include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the genes under conditions in which detection of a desired activity, 
10 e.g., in this case, binding to S. epidermidis polypeptide or an interacting protein, facilitates 
relatively easy isolation of the vector encoding the gene whose product was detected. Each of 
the techniques described below is amenable to high through-put analysis for screening large 
numbers of sequences created, e.g., by random mutagenesis techniques. 

15 Two Hybrid Systems 

Two hybrid assays such as the system described below (as with the other screening 
methods described herein), can be used to identify polypeptides, e.g., fragments or analogs of 
a naturally-occurring S. epidermidis polypeptide, e.g., of cellular proteins, or of randomly 
generated polypeptides which bind to an S. epidermidis protein. (The S. epidermidis 

20 domain is used as the bait protein and the library of variants are expressed as prey fusion 
proteins.) In an analogous fashion, a two hybrid assay (as with the other screening methods 
described herein), can be used to find polypeptides which bind an & epidermidis polypeptide. 

Display Libraries 

25 In one approach to screening assays, the Staphylococcus peptides are displayed on the 

surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an 
appropriate receptor protein via the displayed product is detected in a "panning assay". For 

57 



Attorney Docket: PATH03-16 



example, the gene library can be cloned into the gene for a surface membrane protein of a 
bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 
18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for 
5 potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can be 
used to detect homologs which retain ligand-binding activity. The use of fluorescently labeled 
ligands, allows cells to be visually inspected and separated under a fluorescence microscope, 
or, where the morphology of the cell permits, to be separated by a fluorescence-activated cell 
sorter. 

10 A gene library can be expressed as a fusion protein on the surface of a viral particle. 

For instance, in the filamentous phage system, foreign peptide sequences can be expressed on 
the surface of infectious phage, thereby conferring two significant benefits. First, since these 
phage can be applied to affinity matrices at concentrations well over 10 phage per milliliter, 

a large number of phage can be screened at one time. Second, since each infectious phage 
15 displays a gene product on its surface, if a particular phage is recovered from an affinity 
matrix in low yield, the phage can be amplified by another round of infection. The group of 
almost identical E. coli filamentous phages, Ml 3, fd., and fl, are most often used in phage 
display libraries. Either of the phage gill or gVIII coat proteins can be used to generate fusion 
proteins without disrupting the ultimate packaging of the viral particle. Foreign epitopes can 
20 be expressed at the NH2-terminal end of pill and phage bearing such epitopes recovered from 

a large excess of phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; 
Garrard et al, PCT publication WO 92/09690; Marks et al. (1992) 1 Biol Chem. 267:16007- 
16010; Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624- 
628; and Barbas et al. (1992) PNAS 89:4457-4461). 
25 A common approach uses the maltose receptor of E. coli (the outer membrane protein, 

LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
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peptides fused into one of the extracellular loops of the protein. These peptides are available 
for binding to ligands, e.g., to antibodies, and can elicit an immune response when the cells 
are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. (1991) 
Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and PAL (Fuchs 
5 et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface structures have served 
as vehicles for peptide display. Peptides can be fused to pilin, a protein which polymerizes to 
form the pilus-a conduit for interbacterial exchange of genetic information (Thiry et al. (1989) 
Appl Environ. Microbiol 55, 984-993). Because of its role in interacting with other cells, the 
pilus provides a useful support for the presentation of peptides to the extracellular 

10 environment. Another large surface structure used for peptide display is the bacterial motive 
organ, the flagellum. Fusion of peptides to the subunit protein flagellin offers a dense array of 
many peptide copies on the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). 
Surface proteins of other bacterial species have also served as peptide fusion partners. 
Examples include the Staphylococcus protein A and the outer membrane IgA protease of 

15 Neisseria (Hansson et al. (1992) J. Bacteriol 174, 4239-4245 and Klauser et al. (1990) EMBO 
J. 9, 1991-1999). 

In the filamentous phage systems and the LamB system described above, the physical 
link between the peptide and its encoding DNA occurs by the containment of the DNA within 
a particle (cell or phage) that carries the peptide on its surface. Capturing the peptide captures 

20 the particle and the DNA within. An alternative scheme uses the DNA-binding protein Lad 
to form a link between peptide and DNA (Cull et al (1992) PNAS USA 89:1865-1869). This 
system uses a plasmid containing the Lad gene with an oligonucleotide cloning site at its 3- 
end. Under the controlled induction by arabinose, a Lacl-peptide fusion protein is produced. 
This fusion retains the natural ability of Lad to bind to a short DNA sequence known as LacO 

25 operator (LacO). By installing two copies of LacO on the expression plasmid, the Lacl- 
peptide fusion binds tightly to the plasmid that encoded it. Because the plasmids in each cell 
contain only a single oligonucleotide sequence and each cell expresses only a single peptide 

59 



Attorney Docket: PATH03-16 



sequence, the peptides become specifically and stablely associated with the DNA sequence 
that directed its synthesis. The cells of the library are gently lysed and the peptide-DNA 
complexes are exposed to a matrix of immobilized receptor to recover the complexes 
containing active peptides. The associated plasmid DNA is then reintroduced into cells for 
5 amplification and DNA sequencing to determine the identity of the peptide ligands. As a 
demonstration of the practical utility of the method, a large random library of dodecapeptides 
was made and selected on a monoclonal antibody raised against the opioid peptide dynorphin 
B. A cohort of peptides was recovered, all related by a consensus sequence corresponding to a 
six-residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl. Acad. Set U.S.A. 89-1869) 

10 This scheme, sometimes referred to as peptides-on-plasmids, differs in two important 

ways from the phage display methods. First, the peptides are attached to the C-terminus of the 
fusion protein, resulting in the display of the library members as peptides having free carboxy 
termini. Both of the filamentous phage coat proteins, pill and pVIII, are anchored to the 
phage through their C-termini, and the guest peptides are placed into the outward-extending 

15 N-terminal domains. In some designs, the phage-displayed peptides are presented right at the 
amino terminus of the fusion protein. (Cwirla, et al. (1990) Proc. Natl Acad. ScL U.S.A. 87, 
6378-6382) A second difference is the set of biological biases affecting the population of 
peptides actually present in the libraries. The Lad fusion molecules are confined to the 
cytoplasm of the host cells. The phage coat fusions are exposed briefly to the cytoplasm 

20 during translation but are rapidly secreted through the inner membrane into the periplasmic 
compartment, remaining anchored in the membrane by their C-terminal hydrophobic domains, 
with the N-termini, containing the peptides, protruding into the periplasm while awaiting 
assembly into phage particles. The peptides in the Lad and phage libraries may differ 
significantly as a result of their exposure to different proteolytic activities. The phage coat 

25 proteins require transport across the inner membrane and signal peptidase processing as a 
prelude to incorporation into phage. Certain peptides exert a deleterious effect on these 
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processes and are underrepresented in the libraries (Gallop et al. (1994) J. Med. Chem. 
37(9): 1233- 125 1). These particular biases are not a factor in the Lad display system. 

The number of small peptides available in recombinant random libraries is enormous. 

7 9 11 
Libraries of 10 -10 independent clones are routinely prepared. Libraries as large as 10 

5 recombinants have been created, but this size approaches the practical limit for clone libraries. 

This limitation in library size occurs at the step of transforming the DNA containing 

randomized segments into the host bacterial cells. To circumvent this limitation, an in vitro 

system based on the display of nascent peptides in polysome complexes has recently been 

developed. This display library method has the potential of producing libraries 3-6 orders of 

10 magnitude larger than the currently available phage/phagemid or plasmid libraries. 

Furthermore, the construction of the libraries, expression of the peptides, and screening, is 

done in an entirely cell-free format. 

In one application of this method (Gallop et al. (1994) J. Med. Chem. 37(9):1233- 

12 

1251), a molecular DNA library encoding 10 decapeptides was constructed and the library 
15 expressed in an E. coli S30 in vitro coupled transcription/translation system. Conditions were 
chosen to stall the ribosomes on the mRNA, causing the accumulation of a substantial 
proportion of the RNA in polysomes and yielding complexes containing nascent peptides still 
linked to their encoding RNA. The polysomes are sufficiently robust to be affinity purified on 
immobilized receptors in much the same way as the more conventional recombinant peptide 
20 display libraries are screened. RNA from the bound complexes is recovered, converted to 
cDNA, and amplified by PCR to produce a template for the next round of synthesis and 
screening. The polysome display method can be coupled to the phage display system. 
Following several rounds of screening, cDNA from the enriched pool of polysomes was 
cloned into a phagemid vector. This vector serves as both a peptide expression vector, 
25 displaying peptides fused to the coat proteins, and as a DNA sequencing vector for peptide 
identification. By expressing the polysome-derived peptides on phage, one can either 
continue the affinity selection procedure in this format or assay the peptides on individual 

61 



Attorney Docket: PATH03-16 



clones for binding activity in a phage ELISA, or for binding specificity in a completion phage 
ELISA (Barret, et al. (1992) Anal. Biochem 204,357-364). To identify the sequences of the 
active peptides one sequences the DNA produced by the phagemid host. 

5 Secondary Screening of Polypeptides and Analogs 

The high through-put assays described above can be followed by secondary screens in 
order to identify further biological activities which will, e.g., allow one skilled in the art to 
differentiate agonists from antagonists. The type of a secondary screen used will depend on 
the desired activity that needs to be tested. For example, an assay can be developed in which 
10 the ability to inhibit an interaction between a protein of interest and its respective ligand can 
be used to identify antagonists from a group of peptide fragments isolated though one of the 
primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for activity 
are known in the art. Once the core sequence of interest is identified, it is routine for one 
1 5 skilled in the art to obtain analogs and fragments. 

Peptide Mimetics of £ epidermidis Polypeptides 

The invention also provides for reduction of the protein binding domains of the subject 
S. epidermidis polypeptides to generate mimetics, e.g. peptide or non-peptide agents. The 

20 peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, e.g., in the 
case of an S. epidermidis polypeptide binding to a naturally occurring ligand. The critical 
residues of a subject S. epidermidis polypeptide which are involved in molecular recognition 
of a polypeptide can be determined and used to generate S. epidermidis -derived 
peptidomimetics which competitively or noncompetitively inhibit binding of the S. 

25 epidermidis polypeptide with an interacting polypeptide (see, for example, European patent 
applications EP-4 12,762 A and EP-B31,080A). 
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For example, scanning mutagenesis can be used to map the amino acid residues of a 
particular S. epidermidis polypeptide involved in binding an interacting polypeptide, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which therefore can 
5 inhibit binding of an S. epidermidis polypeptide to an interacting polypeptide and thereby 
interfere with the function of S. epidermidis polypeptide. For instance, non-hydrolyzable 
peptide analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger 
et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: Chemistry and Biology, 

10 G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam 
rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) 
J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of 
the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), b-turn 

15 dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem 
Soc Perkin Trans 1 : 1231), and b-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 
Commun 126:419; and et al. (1986) Biochem Biophys Res Commun 134:71). 



Vaccine Formulations for S. epidermidis Nucleic Acids and Polypeptides 
20 This invention also features vaccine compositions for protection against infection by S. 

epidermidis or for treatment of S. epidermidis infection. In one embodiment, the vaccine 
compositions contain one or more immunogenic components such as a surface protein from 5. 
epidermidis , or portion thereof, and a pharmaceutical^ acceptable carrier. Nucleic acids 
within the scope of the invention are exemplified by the nucleic acids of the invention 
25 contained in the Sequence Listing which encode S. epidermidis surface proteins. Any nucleic 
acid encoding an immunogenic & epidermidis protein, or portion thereof, which is capable of 
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expression in a cell, can be used in the present invention. These vaccines have therapeutic 
and prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection against 
infection by S. epidermidis which contains at least one immunogenic fragment of an S. 
5 epidermidis protein and a pharmaceutical^ acceptable carrier. Preferred fragments include 
peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino acid 
residues in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by screening 
polypeptides recombinantly produced from the corresponding fragment of the nucleic acid 

10 encoding the full-length S. epidermidis protein. In addition, fragments can be chemically 
synthesized using techniques known in the art such as conventional Merrifield solid phase f- 
Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 
peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for example, 

15 T cell proliferation or cytokine secretion are defined herein as comprising at least one T cell 
epitope. T cell epitopes are believed to be involved in initiation and perpetuation of the 
immune response to the protein allergen which is responsible for the clinical symptoms of 
allergy. These T cell epitopes are thought to trigger early events at the level of the T helper 
cell by binding to an appropriate HLA molecule on the surface of an antigen presenting cell, 

20 thereby stimulating the T cell subpopulation with the relevant T cell receptor for the epitope. 
These events lead to T cell proliferation, lymphokine secretion, local inflammatory reactions, 
recruitment of additional immune cells to the site of antigen/T cell interaction, and activation 
of the B cell cascade, leading to the production of antibodies. A T cell epitope is the basic 
element, or smallest unit of recognition by a T cell receptor, where the epitope comprises 

25 amino acids essential to receptor recognition (e.g., approximately 6 or 7 amino acid residues). 
Amino acid sequences which mimic those of the T cell epitopes are within the scope of this 
invention. 
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Screening immunogenic components can be accomplished using one or more of 
several different assays. For example, in vitro, peptide T cell stimulatory activity is assayed 
by contacting a peptide known or suspected of being immunogenic with an antigen presenting 
cell which presents appropriate MHC molecules in a T cell culture. Presentation of an 
5 immunogenic S. epidermidis peptide in association with appropriate MHC molecules to T 
cells in conjunction with the necessary co-stimulation has the effect of transmitting a signal to 
the T cell that induces the production of increased levels of cytokines, particularly of 
interleukin-2 and interleukin-4. The culture supernatant can be obtained and assayed for 
interleukin-2 or other known cytokines. For example, any one of several conventional assays 
10 for interleukin-2 can be employed, such as the assay described in Proc. Natl Acad. Sci USA, 
86: 1333 (1989) the pertinent portions of which are incorporated herein by reference. A kit 
for an assay for the production of interferon is also available from Genzyme Corporation 
(Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 

1 5 thymidine incorporation. The proliferation of T cells can be measured in vitro by determining 
3 

the amount of H-labeled thymidine incorporated into the replicating DNA of cultured cells. 
Therefore, the rate of DNA synthesis and, in turn, the rate of cell division can be quantified. 

Vaccine compositions of the invention containing immunogenic components (e.g., S. 
epidermidis polypeptide or fragment thereof or nucleic acid encoding an S. epidermidis 

20 polypeptide or fragment thereof) preferably include a pharmaceutically acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier that does not cause an 
allergic reaction or other untoward effect in patients to whom it is administered. Suitable 
pharmaceutically acceptable carriers include, for example, one or more of water, saline, 
phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations 

25 thereof. Pharmaceutically acceptable carriers may further comprise minor amounts of 
auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which 
enhance the shelf life or effectiveness of the antibody. For vaccines of the invention 
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containing S. epidermidis polypeptides, the polypeptide is co-administered with a suitable 
adjuvant. 

It will be apparent to those of skill in the art that the therapeutically effective amount 

of DNA or protein of this invention will depend, inter alia, upon the administration schedule, 
5 the unit dose of antibody administered, whether the protein or DNA is administered in 

combination with other therapeutic agents, the immune status and health of the patient, and 

the therapeutic activity of the particular protein or DNA. 

Vaccine compositions are conventionally administered parenterally, e.g., by injection, 

either subcutaneously or intramuscularly. Methods for intramuscular immunization are 
10 described by Wolff et al. (1990) Science 247: 1465-1468 and by Sedegah et al. (1994) 

Immunology 91 : 9866-9870. Other modes of administration include oral and pulmonary 

formulations, suppositories, and transdermal applications. Oral immunization is preferred 

over parenteral methods for inducing protection against infection by S. epidermidis . Cain et. 

al. (1993) Vaccine 11: 637-642. Oral formulations include such normally employed 
1 5 excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium 

stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. 

The vaccine compositions of the invention can include an adjuvant, including, but not 

limited to aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP); 

N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor-MDP); N- 
20 acetylmuramyl-L-alanyl-D-isogto 

hydroxyphos-phoryloxy)-ethylamine (CGP 19835 A, referred to a MTP-PE); RIBI, which 

contains three components from bacteria; monophosphoryl lipid A; trehalose dimycoloate; 

cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 emulsion; and cholera 

toxin. Others which may be used are non-toxic derivatives of cholera toxin, including its B 
25 subunit, and/or conjugates or genetically engineered fusions of the S. epidermidis polypeptide 

with cholera toxin or its B subunit, procholeragenoid, fungal polysaccharides, including 
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schizophyllan, muramyl dipeptide, muramyl dipeptide derivatives, phorbol esters, labile toxin 
of E. coli, non-S epidermidis bacterial lysates, block polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or immuno- 
stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered attenuated 
5 live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like particles, e.g., 
bluetongue. The amount of adjuvant employed will depend on the type of adjuvant used. For 
example, when the mucosal adjuvant is cholera toxin, it is suitably used in an amount of 5 mg 
to 50 mg, for example 10 mg to 35 mg. When used in the form of microcapsules, the amount 
used will depend on the amount employed in the matrix of the microcapsule to achieve the 
10 desired dosage. The determination of this amount is within the skill of a person of ordinary 
skill in the art. 

Carrier systems in humans may include enteric release capsules protecting the antigen 
from the acidic environment of the stomach, and including S. epidermidis polypeptide in an 
insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention are 

15 enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N 
NaHC03 and/or saline. 

Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication of S. epidermidis 
in an infected host, or as a therapeutic agent in the aim to induce an immune response in a 

20 susceptible host to prevent infection by S. epidermidis . The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. Thus, for 
adults a suitable dosage will be in the range of 10 mg to 10 g, preferably 10 mg to 100 mg. A 
suitable dosage for adults will also be in the range of 5 mg to 500 mg. Similar dosage ranges 
will be applicable for children. Those skilled in the art will recognize that the optimal dose 

25 may be more or less depending upon the patient's body weight, disease, the route of 

administration, and other factors. Those skilled in the art will also recognize that appropriate 
dosage levels can be obtained based on results with known oral vaccines such as, for example, 
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a vaccine based on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an 
enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et al., J. Urol 150:917- 
921 (1993); Boedecker et al., American Gastroenterological Assoc. 999:A-222 (1993)). The 
number of doses will depend upon the disease, the formulation, and efficacy data from clinical 
5 trials. Without intending any limitation as to the course of treatment, the treatment can be 
administered over 3 to 8 doses for a primary immunization schedule over 1 month (Boedeker, 
American Gastroenterological Assoc. 888:A-222 (1993)). 

In a preferred embodiment, a vaccine composition of the invention can be based on a 
killed whole E. coli preparation with an immunogenic fragment of an S. epidermidis protein 

10 of the invention expressed on its surface or it can be based on an E. coli lysate, wherein the 
killed E. coli acts as a carrier or an adjuvant. 

It will be apparent to those skilled in the art that some of the vaccine compositions of 
the invention are useful only for preventing S. epidermidis infection, some are useful only for 
treating S. epidermidis infection, and some are useful for both preventing and treating S. 

15 epidermidis infection. In a preferred embodiment, the vaccine composition of the invention 
provides protection against S. epidermidis infection by stimulating humoral and/or cell- 
mediated immunity against £ epidermidis . It should be understood that amelioration of any 
of the symptoms of S. epidermidis infection is a desirable clinical goal, including a lessening 
of the dosage of medication used to treat S. epidermidis -caused disease, or an increase in the 

20 production of antibodies in the serum or mucous of patients. 

Antibodies Reactive With S. epidermidis Polypeptides 

The invention also includes antibodies specifically reactive with the subject S. 
epidermidis polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
25 made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a hamster 
or rabbit can be immunized with an immunogenic form of the peptide. Techniques for 
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conferring immunogenicity on a protein or peptide include conjugation to carriers or other 
techniques well known in the art. An immunogenic portion of the subject S. epidermidis 
polypeptide can be administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. Standard ELIS A or other 
5 immunoassays can be used with the immunogen as antigen to assess the levels of antibodies. 

In a preferred embodiment, the subject antibodies are immunospecific for antigenic 
determinants of the S. epidermidis polypeptides of the invention, e.g. antigenic determinants 
of a polypeptide of the invention contained in the Sequence Listing, or a closely related human 
or non-human mammalian homolog (e.g., 90% homologous, more preferably at least about 

10 95% homologous). In yet a further preferred embodiment of the invention, the anti-& 

epidermidis antibodies do not substantially cross react (i.e., react specifically) with a protein 
which is for example, less than 80% percent homologous to a sequence of the invention 
contained in the Sequence Listing. By "not substantially cross react", it is meant that the 
antibody has a binding affinity for a non-homologous protein which is less than 10 percent, 

15 more preferably less than 5 percent, and even more preferably less than 1 percent, of the 
binding affinity for a protein of the invention contained in the Sequence Listing. In a most 
preferred embodiment, there is no cross-reactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with S. epidermidis polypeptides. Antibodies can be fragmented 

20 using conventional techniques and the fragments screened for utility in the same manner as 
described above for whole antibodies. For example, F(ab')2 fragments can be generated by 
treating antibody with pepsin. The resulting F(ab')2 fragment can be treated to reduce 
disulfide bridges to produce Fab' fragments. The antibody of the invention is further intended 
to include bispecific and chimeric molecules having an anti-5. epidermidis portion. 

25 Both monoclonal and polyclonal antibodies (Ab) directed against S. epidermidis 

polypeptides or S. epidermidis polypeptide variants, and antibody fragments such as Fab' and 
F(ab')2, can be used to block the action of S. epidermidis polypeptide and allow the study of 
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the role of a particular S. epidermidis polypeptide of the invention in aberrant or unwanted 
intracellular signaling, as well as the normal cellular function of the S. epidermidis and by 
microinjection of anti-5. epidermidis polypeptide antibodies of the present invention. 
Antibodies which specifically bind S. epidermidis epitopes can also be used in 
5 immunohistochemical staining of tissue samples in order to evaluate the abundance and 

pattern of expression of S. epidermidis antigens. Anti-& epidermidis polypeptide antibodies 
can be used diagnostically in immuno-precipitation and immuno-blotting to detect and 
evaluate S. epidermidis levels in tissue or bodily fluid as part of a clinical testing procedure. 
Likewise, the ability to monitor S. epidermidis polypeptide levels in an individual can allow 

10 determination of the efficacy of a given treatment regimen for an individual afflicted with 
such a disorder. The level of an S. epidermidis polypeptide can be measured in cells found in 
bodily fluid, such as in urine samples or can be measured in tissue, such as produced by 
gastric biopsy. Diagnostic assays using anti-5. epidermidis antibodies can include, for 
example, immunoassays designed to aid in early diagnosis of S. epidermidis infections. The 

1 5 present invention can also be used as a method of detecting antibodies contained in samples 
from individuals infected by this bacterium using specific S. epidermidis antigens. 

Another application of anti-5. epidermidis polypeptide antibodies of the invention is 
in the immunological screening of cDNA libraries constructed in expression vectors such as 
A,gtl 1, A,gtl8-23, XZAP, and A,ORF8. Messenger libraries of this type, having coding 

20 sequences inserted in the correct reading frame and orientation, can produce fusion proteins. 
For instance, Xgt\ 1 will produce fusion proteins whose amino termini consist of ii- 
galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of a subject S. epidermidis polypeptide can then be detected 
with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with 

25 anti-5. epidermidis polypeptide antibodies. Phage, scored by this assay, can then be isolated 
from the infected plate. Thus, the presence of S. epidermidis gene homologs can be detected 
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and cloned from other species, and alternate isoforms (including splicing variants) can be 
detected and cloned. 

Kits Containing Nucleic Acids, Polypeptides or Antibodies of the Invention 
5 The nucleic acid, polypeptides and antibodies of the invention can be combined with 

other reagents and articles to form kits. Kits for diagnostic purposes typically comprise the 
nucleic acid, polypeptides or antibodies in vials or other suitable vessels. Kits typically 
comprise other reagents for performing hybridization reactions, polymerase chain reactions 
(PCR), or for reconstitution of lyophilized components, such as aqueous media, salts, buffers, 

10 and the like. Kits may also comprise reagents for sample processing such as detergents, 

chaotropic salts and the like. Kits may also comprise immobilization means such as particles, 
supports, wells, dipsticks and the like. Kits may also comprise labeling means such as dyes, 
developing reagents, radioisotopes, fluorescent agents, luminescent or chemiluminescent 
agents, enzymes, intercalating agents and the like. With the nucleic acid and amino acid 

15 sequence information provided herein, individuals skilled in art can readily assemble kits to 
serve their particular purpose. Kits further can include instructions for use. 

Bio chip Technology 

The nucleic acid sequence of the present invention may be used to detect S. 

20 epidermidis or other species of Staphylococcus acid sequence using bio chip technology. Bio 
chips containing arrays of nucleic acid sequence can also be used to measure expression of 
genes of S. epidermidis or other species of Staphylococcus. For example, to diagnose a 
patient with a S. epidermidis or other Staphylococcus infection, a sample from a human or 
animal can be used as a probe on a bio chip containing an array of nucleic acid sequence from 

25 the present invention. In addition, a sample from a disease state can be compared to a sample 
from a non-disease state which would help identify a gene that is up-regulated or expressed in 
the disease state. This would provide valuable insight as to the mechanism by which the 
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disease manifests. Changes in gene expression can also be used to identify critical pathways 
involved in drug transport or metabolism, and may enable the identification of novel targets 
involved in virulence or host cell interactions involved in maintenance of an infection. 
Procedures using such techniques have been described by Brown et aL, 1995, Science 270: 
5 467-470. 

Bio chips can also be used to monitor the genetic changes of potential therapeutic 
compounds including, deletions, insertions or mismatches. Once the therapeutic is added to 
the patient, changes to the genetic sequence can be evaluated for its efficacy. In addition, the 
nucleic acid sequence of the present invention can be used to determine essential genes in cell 

10 cycling. As described in Iyer et aL, 1999 {Science, 283:83-87 ) genes essential in the cell 
cycle can be identified using bio chips. Furthermore, the present invention provides nucleic 
acid sequence which can be used with bio chip technology to understand regulatory networks 
in bacteria, measure the response to environmental signals or drugs as in drug screening, and 
study virulence induction. (Mons et aL, 1998, Nature Biotechnology, 16: 45-48. Patents 

1 5 teaching this technology include U.S. Patents 5445934, 5744305, and 5800992. 

Drug Screening Assays Using S. epidermidis Polypeptides 

By making available purified and recombinant S. epidermidis polypeptides, the 
present invention provides assays which can be used to screen for drugs which are either 

20 agonists or antagonists of the normal cellular function, in this case, of the subject S. 
epidermidis polypeptides, or of their role in intracellular signaling. Such inhibitors or 
potentiators may be useful as new therapeutic agents to combat S. epidermidis infections in 
humans. A variety of assay formats will suffice and, in light of the present inventions, will be 
comprehended by the person skilled in the art. 

25 In many drug screening programs which test libraries of compounds and natural 

extracts, high throughput assays are desirable in order to maximize the number of compounds 
surveyed in a given period of time. Assays which are performed in cell-free systems, such as 
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may be derived with purified or semi-purified proteins, are often preferred as "primary" 
screens in that they can be generated to permit rapid development and relatively easy detection 
of an alteration in a molecular target which is mediated by a test compound. Moreover, the 
effects of cellular toxicity and/or bioavailability of the test compound can be generally ignored 
5 in the in vitro system, the assay instead being focused primarily on the effect of the drug on 
the molecular target as may be manifest in an alteration of binding affinity with other proteins 
or change in enzymatic properties of the molecular target. Accordingly, in an exemplary 
screening assay of the present invention, the compound of interest is contacted with an 
isolated and purified S. epidermidis polypeptide. 

10 Screening assays can be constructed in vitro with a purified S. epidermidis 

polypeptide or fragment thereof, such as an S. epidermidis polypeptide having enzymatic 
activity, such that the activity of the polypeptide produces a detectable reaction product. The 
efficacy of the compound can be assessed by generating dose response curves from data 
obtained using various concentrations of the test compound. Moreover, a control assay can 

1 5 also be performed to provide a baseline for comparison. Suitable products include those with 
distinctive absorption, fluorescence, or chemi-luminescence properties, for example, because 
detection may be easily automated. A variety of synthetic or naturally occurring compounds 
can be tested in the assay to identify those which inhibit or potentiate the activity of the S. 
epidermidis polypeptide. Some of these active compounds may directly, or with chemical 

20 alterations to promote membrane permeability or solubility, also inhibit or potentiate the same 
activity (e.g., enzymatic activity) in whole, live S. epidermidis cells. 

Overexpression Assays 

Overexpression assays are based on the premise that overproduction of a protein 
25 would lead to a higher level of resistance to compounds that selectively interfere with the 
function of that protein. Overexpression assays may be used to identify compounds that 
interfere with the function of virtually any type of protein, including without limitation 
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enzymes, receptors, DNA- or RNA-binding proteins, or any proteins that are directly or 
indirectly involved in regulating cell growth. 

Typically, two bacterial strains are constructed. One contains a single copy of the gene 
of interest, and a second contains several copies of the same gene. Identification of useful 
5 inhibitory compounds of this type of assay is based on a comparison of the activity of a test 
compound in inhibiting growth and/or viability of the two strains. The method involves 
constructing a nucleic acid vector that directs high level expression of a particular target 
nucleic acid. The vectors are then transformed into host cells in single or multiple copies to 
produce strains that express low to moderate and high levels of protein encoding by the target 

10 sequence (strain A and B, respectively). Nucleic acid comprising sequences encoding the 
target gene can, of course, be directly integrated into the host cell. 

Large numbers of compounds (or crude substances which may contain active 
compounds) are screened for their effect on the growth of the two strains. Agents which 
interfere with an unrelated target equally inhibit the growth of both strains. Agents which 

1 5 interfere with the function of the target at high concentration should inhibit the growth of both 
strains. It should be possible, however, to titrate out the inhibitory effect of the compound in 
the overexpressing strain. That is, if the compound is affecting the particular target that is 
being tested, it should be possible to inhibit the growth of strain A at a concentration of the 
compound that allows strain B to grow. 

20 Alternatively, a bacterial strain is constructed that contains the gene of interest under 

the control of an inducible promoter. Identification of useful inhibitory agents using this type 
of assay is based on a comparison of the activity of a test compound in inhibiting growth 
and/or viability of this strain under both inducing and non-inducing conditions. The method 
involves constructing a nucleic acid vector that directs high-level expression of a particular 

25 target nucleic acid. The vector is then transformed into host cells that are grown under both 
non-inducing and inducing conditions (conditions A and B, respectively). 
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Large numbers of compounds (or crude substances which may contain active 
compounds) are screened for their effect on growth under these two conditions. Agents that 
interfere with the function of the target should inhibit growth under both conditions. It should 
be possible, however, to titrate out the inhibitory effect of the compound in the overexpressing 
5 strain. That is, if the compound is affecting the particular target that is being tested, it should 
be possible to inhibit growth under condition A at a concentration that allows the strain to 
grow under condition B. 

Ligand-binding Assays 

10 Many of the targets according to the invention have functions that have not yet been 

identified. Ligand-binding assays are useful to identify inhibitor compounds that interfere 
with the function of a particular target, even when that function is unknown. These assays are 
designed to detect binding of test compounds to particular targets. The detection may involve 
direct measurement of binding. Alternatively, indirect indications of binding may involve 

1 5 stabilization of protein structure or disruption of a biological function. Non-limiting examples 
of useful ligand-binding assays are detailed below. 

A useful method for the detection and isolation of binding proteins is the Biomolecular 
Interaction Assay (BIAcore) system developed by Pharmacia Biosensor and described in the 
manufacturer's protocol (LKB Pharmacia, Sweden). The BIAcore system uses an affinity 

20 purified anti-GST antibody to immobilize GST-fusion proteins onto a sensor chip. The sensor 
utilizes surface plasmon resonance which is an optical phenomenon that detects changes in 
refractive indices. In accordance with the practice of the invention, a protein of interest is 
coated onto a chip and test compounds are passed over the chip. Binding is detected by a 
change in the refractive index (surface plasmon resonance). 

25 A different type of ligand-binding assay involves scintillation proximity assays (SPA, 

described in U.S. Patent No. 4,568,649). 
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Another type of ligand binding assay, also undergoing development, is based on the 
fact that proteins containing mitochondrial targeting signals are imported into isolated 
mitochondria in vitro (Hurt et a/., 1985, Embo J. 4:2061-2068; Eilers and Schatz, Nature, 
1986, 322:228-231). In a mitochondrial import assay, expression vectors are constructed in 
5 which nucleic acids encoding particular target proteins are inserted downstream of sequences 
encoding mitochondrial import signals. The chimeric proteins are synthesized and tested for 
their ability to be imported into isolated mitochondria in the absence and presence of test 
compounds. A test compound that binds to the target protein should inhibit its uptake into 
isolated mitochondria in vitro. 

10 Another ligand-binding assay is the yeast two-hybrid system (Fields and Song, 1989, 

Nature 340:245-246). The yeast two-hybrid system takes advantage of the properties of the 
GAL4 protein of the yeast Saccharomyces cerevisiae. The GAL4 protein is a transcriptional 
activator required for the expression of genes encoding enzymes of galactose utilization. This 
protein consists of two separable and functionally essential domains: an N-terminal domain 

1 5 which binds to specific DNA sequences (UAS G ); and a C-terminal domain containing acidic 
regions, which is necessary to activate transcription. The native GAL4 protein, containing 
both domains, is a potent activator of transcription when yeast are grown on galactose media. 
The N-terminal domain binds to DNA in a sequence-specific manner but is unable to activate 
transcription. The C-terminal domain contains the activating regions but cannot activate 

20 transcription because it fails to be localized to UASq. In the two-hybrid system, a system of 
two hybrid proteins containing parts of GAL4: (1) a GAL4 DNA-binding domain fused to a 
protein f X f and (2) a GAL4 activation region fused to a protein f Y\ If X and Y can form a 
protein-protein complex and reconstitute proximity of the GAL4 domains, transcription of a 
gene regulated by UASq occurs. Creation of two hybrid proteins, each containing one of the 

25 interacting proteins X and Y, allows the activation region of UASg to be brought to its normal 
site of action. 
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The binding assay described in Fodor et aL, 1991, Science 25 1 : 767-773, which 
involves testing the binding affinity of test compounds for a plurality of defined polymers 
synthesized on a solid substrate, may also be useful. 

Compounds which bind to the polypeptides of the invention are potentially useful as 
5 antibacterial agents for use in therapeutic compositions. 

Pharmaceutical formulations suitable for antibacterial therapy comprise the 
antibacterial agent in conjunction with one or more biologically acceptable carriers. Suitable 
biologically acceptable carriers include, but are not limited to, phosphate-buffered saline, 
saline, deionized water, or the like. Preferred biologically acceptable carriers are 
1 0 physiologically or pharmaceutical^ acceptable carriers. 

The antibacterial compositions include an antibacterial effective amount of active 
agent. Antibacterial effective amounts are those quantities of the antibacterial agents of the 
present invention that afford prophylactic protection against bacterial infections or which 
result in amelioration or cure of an existing bacterial infection. This antibacterial effective 
15 amount will depend upon the agent, the location and nature of the infection, and the particular 
host. The amount can be determined by experimentation known in the art, such as by 
establishing a matrix of dosages and frequencies and comparing a group of experimental units 
or subjects to each point in the matrix. 

The antibacterial active agents or compositions can be formed into dosage unit forms, 
20 such as for example, creams, ointments, lotions, powders, liquids, tablets, capsules, 

suppositories, sprays, aerosols or the like. If the antibacterial composition is formulated into a 
dosage unit form, the dosage unit form may contain an antibacterial effective amount of active 
agent. Alternatively, the dosage unit form may include less than such an amount if multiple 
dosage unit forms or multiple dosages are to be used to administer a total dosage of the active 
25 agent. Dosage unit forms can include, in addition, one or more excipient(s), diluent(s), 
disintegrant(s), lubricant(s), plasticizer(s), colorant(s), dosage vehicle(s), absorption 
enhancer(s), stabilizer(s), bactericide(s), or the like. 
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For general information concerning formulations, see, e.g., Gilman et al. (eds.), 1990, 
Goodman and Gilman 's: The Pharmacological Basis of Therapeutics, 8th ed., Pergamon 
Press; and Remington's Pharmaceutical Sciences, 17th ed., 1990, Mack Publishing Co., 
Easton, PA; Avis et al. (eds.), 1993, Pharmaceutical Dosage Forms: Parenteral Medications, 
5 Dekker, New York; Lieberman et al (eds.), 1990, Pharmaceutical Dosage Forms: Disperse 
Systems, Dekker, New York. 

The antibacterial agents and compositions of the present invention are useful for 
preventing or treating S. epidermidis infections. Infection prevention methods incorporate a 
prophylactically effective amount of an antibacterial agent or composition. A prophylactically 
10 effective amount is an amount effective to prevent S. epidermidis infection and will depend 
upon the specific bacterial strain, the agent, and the host. These amounts can be determined 
experimentally by methods known in the art and as described above. 

S. epidermidis infection treatment methods incorporate a therapeutically effective 
amount of an antibacterial agent or composition. A therapeutically effective amount is an 
15 amount sufficient to ameliorate or eliminate the infection. The prophylactically and/or 

therapeutically effective amounts can be administered in one administration or over repeated 
administrations. Therapeutic administration can be followed by prophylactic administration, 
once the initial bacterial infection has been resolved. 

The antibacterial agents and compositions can be administered topically or 
20 systemically. Topical application is typically achieved by administration of creams, 

ointments, lotions, or sprays as described above. Systemic administration includes both oral 
and parental routes. Parental routes include, without limitation, subcutaneous, intramuscular, 
intraperitoneal, intravenous, transdermal, inhalation and intranasal administration. 



25 
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EXEMPLIFICATION 

Cloning and Sequencing S. epidermidis Genomic Sequence 

This invention provides nucleotide sequences of the genome of S. epidermidis which 
5 thus comprises a DNA sequence library of S. epidermidis genomic DNA. The invention also 
provides nucleotide sequences of two naturally occurring plasmids in S. epidermidis. The 
detailed description that follows provides nucleotide sequences of S. epidermidis, and also 
describes how the sequences were obtained and how ORFs (Open Reading Frames) and 
protein-coding sequences can be identified. Also described are methods of using the disclosed 

10 S. epidermidis sequences in methods including diagnostic and therapeutic applications. 
Furthermore, the library can be used as a database for identification and comparison of 
medically important sequences in this and other strains of S. epidermidis as well as other 
species of Staphylococcus . 

Chromosomal DNA from strain 18972 of S. epidermidis, was isolated using a protocol 

15 described by Storrs, et al.(J. Bacteriol 173: 4347-4352 (1991). The only exception to this 
protocol was that lysostaphin (120 U/ml) was used instead of lysozyme. Two endogenous 
plasmids of approximately 39 Kb and 2.9 Kb in size were identified upon visualization of the 
S. epidermidis genomic DNA on a 0.5% agarose gel. The first library constructed contained 
fragments from the S. epidermidis genome as well as from the endogenous plasmid. A second 

20 library was later constructed with genomic DNA, from which the plasmid DNA was removed 
by CsCl centrifugation. The genomic DNA prep involved a lysozyme: lysostaphin digestion, 
sodium dodecyl sulfate lysis, Proteinase K and RNase treatment, phenol: chloroform 
extraction, and sodium acetate precipitation, followed by the CsCl gradient to remove the 
plasmid. 

25 In the construction of both libraries, genomic S. epidermidis DNA was 

hydrodynamically sheared in an HPLC and then separated on a standard 1% agarose gel. A 
fraction corresponding to 2000-3000 bp in length was excised from the gel and purifed by the 
GeneClean procedure (Bio 101, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 

30 healed DNA was then ligated to unique BstXI-linker adapters (5'-GTCTTCACCACGGGG-3' 
and 5 '-GTGGTGAAGAC-3 ' in 100-1000 fold molar excess). These linkers are 
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complimentary to the BstXI-cut pGTC vector, while the overhang is not self-complimentary. 
Therefore, the linkers will not concatermerize nor will the cut- vector religate itself easily. The 
linker-adapted inserts were separated from the unincorporated linkers on a 1% agarose gel and 
purified using GeneClean. The linker-adapted inserts were then ligated to Zfo/XI-cut vector to 
5 construct a "shotgun" sublclone libraries. 

Only major modifications to the protocols are highlighted. Briefly, the library was 
then transformed into DH5a competent cells (Gibco/BRL, DH5a transformation protocol). It 
was assessed by plating onto antibiotic plates containing ampicillin and IPTG/Xgal. The 
plates were incubated overnight at 37°C. Transformants were then used for plating of clones 

10 and picking for sequencing. The cultures were grown overnight at 3 7°C. DN A was purified 
using a silica bead DNA preparation (Engelstein, 1996) method. In this manner, 25 jxg of 
DNA was obtained per clone. 

These purified DNA samples were then sequenced using primarily ABI dye-terminator 
chemistry. All subsequent steps were based on sequencing by ABB 77 automated DNA 

1 5 sequencing methods. The ABI dye terminator sequence reads were run on ABB 77 machines 
and the data was transferred to UNIX machines following lane tracking of the gels. Base 
calls and quality scores were determined using the program PHRED (Ewing et al., 1998, 
Genome Res. 8: 175-185; Ewing and Green, 1998, Genome Res. 8: 685-734). Reads were 
assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program Contractor- 

20 Grantee Workshop V, Jan. 1996, p. 157) with default program parameters and quality scores. 

Finishing followed the initial assembly. Missing mates (sequences from clones that 
only gave reads from one end of the Staphylococcus DNA inserted in the plasmid) were 
identified and sequenced with ABI technology to allow the identification of additional 
overlapping contigs. 

25 End-sequencing of randomly picked genomic lambda was also performed. Sequencing 

of both sides was done for all lambda sequences. The lambda library backbone helped to 
verify the integrity of the assembly and allowed closure of some of the physical gaps. Primers 
for walking off the ends of contigs would be selected using pick_primer ( a GTC program) 
near the ends of the clones to facilitate gap closure. These walks can be sequenced using the 

30 selected clones and primers. These data are then reassembled with PHRAP. Additional 
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sequencing using PCR-generated templates and screened and/or unscreened lambda templates 
can be done in addition. 

Additional templates for the physical gaps were obtained through PCR using primers 
designed from the ends of the contigs. These templates were then used in sequencing 
5 reactions to close the gaps. 

Contigs were ordered by aligning identified S. epidermidis genes to the published physical 
maps. Order was confirmed by PCR. The final chromosomal assembly included 23 ordered 
contigs and the two plasmids each contained lcontig each. 

To identify S. epidermidis polypeptides the complete genomic sequence of S. 

10 epidermidis were analyzed essentially as follows: First, all possible stop-to- stop open reading 
frames (ORFs) greater than 180 nucleotides in all six reading frames were translated into 
amino acid sequences. Second, the identified ORFs were analyzed for homology to known 
(archeabacter, prokaryotic and eukaryotic) protein sequences. Third, the coding potential of 
non-homologous sequences were evaluated with the program GENEMARKTM (Borodovsky 

15 and Mclninch, 1993, Comp. Chem. 17:123). 

Identification, Cloning and Expression of S. epidermidis Nucleic Acids 

Expression and purification of the S. epidermidis polypeptides of the invention can be 
performed essentially as outlined below. 

20 To facilitate the cloning, expression and purification of membrane and secreted 

proteins from S. epidermidis , a gene expression system, such as the pET System (Novagen), 
for cloning and expression of recombinant proteins in E. coli, is selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, is fused to the 3' end of DNA sequences of 
interest in order to facilitate purification of the recombinant protein products. The 3' end is 

25 selected for fusion in order to avoid alteration of any 5' terminal signal sequence. 
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PCR Amplification and Cloning of Nucleic Acids Containing ORFs Encoding Enzymes 

Nucleic acids chosen (for example, from the nucleic acids set forth in SEQ ID NO: 1 - 
SEQ ID NO: 3772 for cloning from the 18972 strain of S. epidermidis and plasmids are 
prepared for amplification cloning by polymerase chain reaction (PCR). Synthetic 
5 oligonucleotide primers specific for the S 1 and 3 7 ends of open reading frames (ORFs) are 
designed and purchased from GibcoBRL Life Technologies (Gaithersburg, MD, USA). All 
forward primers (specific for the 5 / end of the sequence) are designed to include an Ncol 
cloning site at the extreme 5 7 terminus. These primers are designed to permit initiation of 
protein translation at a methionine residue followed by a valine residue and the coding 

10 sequence for the remainder of the native 5. epidermidis DNA sequence. All reverse primers 
(specific for the 3 f end of any S. epidermidis ORF) include a EcoRI site at the extreme S 1 
terminus to permit cloning of each S. epidermidis sequence into the reading frame of the 
pET-28b. The pET-28b vector provides sequence encoding an additional 20 carboxy-terminal 
amino acids including six histidine residues (at the extreme C-terminus), which comprise the 

15 His-Tag. 

Genomic DNA or plasmid DNA prepared from the 1 8972 strain of S. epidermidis is 
used as the source of template DNA for PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). To amplify a 
DNA sequence containing an S. epidermidis ORF, genomic DNA (50 nanograms) is 
20 introduced into a reaction vial containing 2 mM MgCl2, 1 micromolar synthetic 

oligonucleotide primers (forward and reverse primers) complementary to and flanking a 
defined S. epidermidis ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ, USA) in a final volume of 100 microliters. 
25 Upon completion of thermal cycling reactions, each sample of amplified DNA is 

washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, 
MD, USA). All amplified DNA samples are subjected to digestion with the restriction 
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endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA, USA)(Current 
Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 
DNA samples are then subjected to electrophoresis on 1 .0 % NuSeive (FMC BioProducts, 
Rockland, ME USA) agarose gels. DNA is visualized by exposure to ethidium bromide and 
5 long wave uv irradiation. DNA contained in slices isolated from the agarose gel is purified 
using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, USA). 

Cloning of S. epidermidis Nucleic Acids Into an Expression Vector 

The pET-28b vector is prepared for cloning by digestion with restriction 
10 endonucleases, e.g., Ncol and EcoRI (Current Protocols in Molecular Biology, John Wiley 

and Sons, Inc., F. Ausubel et al., eds., 1994). The pET-28a vector, which encodes a His-Tag 

that can be fused to the 5 7 end of an inserted gene, is prepared by digestion with appropriate 

restriction endonucleases. 

Following digestion, DNA inserts are cloned (Current Protocols in Molecular Biology, 
15 John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously digested pET-28b 

expression vector. Products of the ligation reaction are then used to transform the BL21 strain 

of E. coli (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et 

al., eds., 1994) as described below. 

20 Transformation Of Competent Bacteria With Recombinant Plasmids 

Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), are transformed 
with recombinant pET expression plasmids carrying the cloned S. epidermidis sequences 
according to standard methods (Current Protocols in Molecular, John Wiley and Sons, Inc., F. 
Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction is mixed with 50 

25 microliters of electrocompetent cells and subjected to a high voltage pulse, after which, 

samples are incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 2.0 % tryptone, 10 
mM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 20, mM glucose) at 37^C with 



83 



Attorney Docket: PATH03-16 



shaking for 1 hour. Samples are then spread on LB agar plates containing 25 microgram/ml 
kanamycin sulfate for growth overnight. Transformed colonies of BL21 are then picked and 
analyzed to evaluate cloned inserts as described below. 

5 Identification Of Recombinant Expression Vectors With S. epidermidis Nucleic Acids 

Individual BL21 clones transformed with recombinant pET-28b S. epidermidis ORFs 
are analyzed by PCR amplification of the cloned inserts using the same forward and reverse 
primers, specific for each S. epidermidis sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verifies the integration of the S. 
10 epidermidis sequences in the expression vector (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 

Isolation and Preparation of Nucleic Acids From Transform ants 

Individual clones of recombinant pET-28b vectors carrying properly cloned S. 
15 epidermidis ORFs are picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
kanamycin sulfate overnight. The following day plasmid DNA is isolated and purified using 
the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA). 

Expression Of Recombinant S. epidermidis Sequences In E. coli 

20 The pET vector can be propagated in any E. coli K-l 2 strain e.g. HMS 1 74, HB 1 01 , 

JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for expression 
include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. 
These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lad gene, 
the lacUVS promoter and the gene for T7 RNA polymerase. T7 RNA polymerase is induced 

25 by addition of isopropyl-B-D-thiogalactoside (IPTG), and the T7 RNA polymerase transcribes 
any target plasmid, such as pET-28b, carrying its gene of interest. Strains used include: 
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BL21(DE3) (Studier, F.W., Rosenberg, A.H., Dunn, J.J., and Dubendorff, J.W. (1990) Meth. 

Enzymol. 185,60-89). 

To express recombinant S. epidermidis sequences, 50 nanograms of plasmid DNA 

isolated as described above is used to transform competent BL21(DE3) bacteria as described 

5 above (provided by Novagen as part of the pET expression system kit). The lacZ gene (beta- 

galactosidase) is expressed in the pET-System as described for the S. epidermidis 

recombinant constructions. Transformed cells are cultured in SOC medium for 1 hour, and 

the culture is then plated on LB plates containing 25 micrograms/ml kanamycin sulfate. The 

following day, bacterial colonies are pooled and grown in LB medium containing kanamycin 

10 sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 1 .0 O.D. units, at which 

point, 1 millimolar IPTG was added to the culture for 3 hours to induce gene expression of the 

S. epidermidis recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria are pelleted by centrifugation 

o 

in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4 C. Pellets are resuspended in 

15 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 mM EDTA (STE buffer). 

o 

Cells are then centrifuged at 2000 x g for 20 min at 4 C. Wet pellets are weighed and frozen 
o 

at -80 C until ready for protein purification. 

A variety of methodologies known in the art can be utilized to purify the isolated 
proteins. (Current Protocols in Protein Science, John Wiley and Sons, Inc., J. E. Coligan et al., 

20 eds., 1995). For example, the frozen cells may be thawed, resupended in buffer and ruptured 
by several passages through a small volume microfluidizer (Model M-l 10S, Microfluidics 
International Corporation, Newton, MA). The resultant homogenate may be centrifuged to 
yield a clear supernatant (crude extract) and following filtration the crude extract may be 
fractionated over columns. Fractions may be monitored by absorbance at OD280 nm - 

25 peak fractions may analyzed by SDS-PAGE 

The concentrations of purified protein preparations may be quantified 
spectrophotometrically using absorbance coefficients calculated from amino acid content 
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(Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations are also measured 
by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, and Lowry, O.H., 
Rosebrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chem. 193, pages 265-275, using 
bovine serum albumin as a standard. 

SDS-polyacrylamide gels of various concentrations may be purchased from BioRad 
(Hercules, CA, USA), and stained with Coomassie blue. Molecular weight markers may 
include rabbit skeletal muscle myosin (200 kDa), E. coli (-galactosidase (116 kDa), rabbit 
muscle phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), 
bovine carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white 
lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa). 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
described herein. The specific embodiments described herein are offered by way of example 
only, and the invention is to limited only by the terms of the appended claims, along with the 
full scope of equivalents to which such claims are entitled. 
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Description 

gp: [GI:g2735511] [LN: SCU96108] [AC:U96108] [PN:YwpF homolog] [OR : Staphylococcus 
carnosus] [DB : genpept -bct2 ] [DE : Staphylococcus carnosus (3R) -hydroxymyristoyl acyl 
carrier proteindehydrase homolog (fabZ) gene, partial cds, YwpF homolog, single -strand 
binding protein homolog (ssb) , SceD precursor (sceD),SceA precursor (sceA) and SceE 
precursor (sceE) genes, completecds, and TenA homolog (tenA) gene, partial cds.] 

[NT:Orf2; similar to B. subtilis YwpF protein encoded] [LE:219] [RE:668] 

[DI : complement] 
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Description 

gp: [GI:g2735513] [LN: SCU96108] [AC:U96108] [PN:SceD precursor] [GN: sceD] 
[OR: Staphylococcus carnosus] [DB : genpept -bct2] [DE : Staphylococcus carnosus 
(3R) -hydroxymyristoyl acyl carrier proteindehydrase homolog (fabZ) gene, partial cds, 
YwpF homolog, single-strand binding protein homolog (ssb), SceD precursor (sceD), SceA 
precursor (sceA) and SceE precursor (sceE) genes, completecds, and TenA homolog (tenA) 
gene, partial cds.] [NT: secreted protein] [LE:1825] [RE:2523] [DI:direct] 
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Description 

gp: [GI:g642965] [LN : ABCARRA] [AC:X70360] [GN : carR] [OR : Azospirillum brasilense] 
[DB:genpept-bctl] [DE : A. brasilense carR gene.] [NT : ORF2 ] [LE:59] [RE:580] [DI:direct] 
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Description 

pir : [LN:D71245] [AC:D71245 ] [PN : hypothetical protein PH0221] [GN:PH0221 ] 
[ORrPyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030234 : g3256608] [LN : AP000001] 
[AC:AP000001 : AB009465 : AB009464 :AB009466 : AB00946 7 : AB009468 :AB00946 9] [PN:2 35aa long 

hypothetical protein] [GN:PH0221] [ORrPyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 1-287000 

nt. position (1/7).] [LE:194212] [RE:194919] [DI : complement] 
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Description 

gp: [GI:g2735516] [LN : SCU96108 ] [AC:U96108] [PNiTenA homolog] [GNrtenA] 
[OR: Staphylococcus carnosus] [DB : genpept-bct2] [DE : Staphylococcus carnosus 
(3R) -hydroxymyristoyl acyl carrier proteindehydrase homolog (fabZ) gene, partial cds, 
YwpF homolog, single- strand binding protein homolog (ssb) , SceD precursor (sceD),SceA 
precursor (sceA) and SceE precursor (sceE) genes, completecds, and TenA homolog (tenA) 
gene, partial cds . ] [NT: similar to TenA of B. subtilis, encoded by GenBank] [LE:4569] 
[RE : >5086] [DI : direct] 
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Description 

sp : [LN : DDL_BACSU] [AC:P96612] [GN : DDLA : DDL] [OR:BACILLUS SUBTILIS] [EC:6.3.2.4] 
[DE : D-ALANINE- -D-ALANINE LIGASE, (D-ALANYLALANINE SYNTHETASE)] [SP:P96612] 
[DB:swissprot] >pir: [LN:D69613] [AC:D69613 ] [PN : D-alanyl-D- alanine ligase A ddlA] 
[GN: ddlA ] [CL:D-alanine--D-alanine ligase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl020046:gl881266] [LN : AB001488] [AC : AB001488] [ PN : PROBABLE 

D-ALANINE- -D-ALANINE LIGASE A] [GN : ddlA] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain:168) DNA] [DB : genpept-bctl] [EC:6.3.2.4] [DE:Bacillus subtilis genome sequence, 

148 kb sequence of the regionbetween 35 and 47 degree.] [LE: 41311] [RE: 42375] 
[DI: direct] >gp: [GI :ell82422 :g2632756] [LN : BSUB0003 ] [AC : Z99106 : AL009126] 
[PN:D-alanyl-D-alanine ligase A] [GN: ddlA] [FN:peptidoglycan biosynthesis] [OR: Bacillus 

subtilis] [DB: genpept-bctl] [EC:6.3.2.4] [DE:Bacillus subtilis complete genome (section 

3 of 21): from 402751 to611850.] [SP:P96612] [LE:105055] [RE:106119] [DIrdirect] 
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Description 

gp: [GI:dl037675:g4126674] [LN : AB016431] [AC : AB016431] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain: 912) DNA] [DB : genpept-bctl] [DE : Staphylococcus 
aureus, zinc responsible operon czr genes, completeand partial cds.] [NT : Hypothetical 
protein] [LE:2175] [RE: 2813] [DIrdirect] 



88 



ORF Name 



^17503000979 12 tl 1 



NT ID 



[13 



NT AA 
AAID — t _ — . , score 
LengthLength 



j [3785 |pn [ 



7T 



probability 
1|0 . 0084 



Description 

pir : [LN:A44803] [AC:A44803 ] [PNrpGl protein] [OR:Homo sapiens] [SR:, man] [DB:pir2] 
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Description 

pir : [LN:A70842] [AC:A70842 ] [PN:probable amiB protein] [GN:amiB ] [OR : Mycobacterium 
tuberculosis] [DB:pir2] >gp: [GI : el251137 :g2894215] [LN:MTV016] [AC : AL021841 : AL123456] 
[PN:amiB] [GNiamiB] [OR : Mycobacterium tuberculosis] [DB : genpept-bctl] [DE : Mycobacterium 
tuberculosis H37Rv complete genome; segment 143/162.] [NT:Rv3306c, (MTV016 . 05c) , len: 
394. amiB, Probable] [LE:3363] [RE:4547] [DI : complement] >gp : [GI : el251137 :g2894215] 
[LN:MTV016] [AC : AL021841 : AL123456] [PN:amiB] [GN:amiB] [OR : Mycobacterium tuberculosis] 
[DB:genpept] [DE : Mycobacterium tuberculosis H37Rv complete genome; segment 143/162.] 
[NT:Rv3306c, (MTV016 . 05c) , len: 394. amiB, Probable] [LE:3363] [RE:4547] 
[DI : complement] 



ORF Name 



NT AA 
NTID AAID — — _ score 
Length Length 



probability 



IAI7503000979 13790952 c2 422 
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Description 

pir : [LN:B69619] [AC:B69619 ] [PN : phosphodeoxyribomutase drm] [GN:drm ] 
[CL:phosphopentomutase] [ORrBacillus subtilis] [DB:pir2] >gp: [GI : dl 01333 0 :gl303995] 
[LN:BACJH642] [AC : D84432 :D82370] [PN: YqkN] [OR:Bacillus subtilis] [SRrBacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 271220] [RE: 272404] [DI:direct] 
>gp: [GI:ell85619:g2634785] [LN : BSUB0013] [AC : Z99116 : AL009126] 

[PN: phosphodeoxyribomutase] [GN:drm] [FN: conversion of ribose-l-P/deoxyribose- 1-P to] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [EC: 5. 4. 2. 7] [DE: Bacillus subtilis complete 
genome (section 13 of 21): from 2395261to 2613730.] [NT : alternate gene name: yqkN] 
[SP:P46353] [LE:51215] [RE:52399] [DI : complement ] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000979 1415877 c2 413 



][ 



[3788 



1912 I 



2TTT" 



probability 
|3.7e-16 



Description 

gp: [GI:g4982462] [LN : AE001824] [AC : AE001824 : AE000512] [PN : conserved hypothetical 
protein] [GN:TM1876] [OR : Thermo toga maritima] [DB :genpept-bct2] [DE : Thermo toga maritima 
section 136 of 136 of the complete genome.] [NT: similar to GB:Pyro_h percent identity: 
60.17;] [LE:4717] [RE:5481] [DI:direct] 
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NTID 



MTO NT AA 

AAID — _ — , score 

Length Length 



probability 



IAI7503000979 14160455 c2 466 
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Description 
NO-HIT 
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score probability 



1AI7503000979 14225327 tl 91 
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Description 

gp: [GI:dl037673:g4126672] [LN: AB016431] [AC : AB016431] [GN: czrB] [FN:Zinc resistance] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain: 912) DNA] 
[DB: genpept-bctl] [DE : Staphylococcus aureus, zinc responsible operon czr genes, 
completeand partial cds . ] [NT:czcD] [LE:724] [RE:1701] [DI : complement] 
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ORF Name 



NT ID 



NT AA 

AAID • . , score 

LengthLength 



probability 



IAI7503000979 1438927 ci 356 
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Description 

Sp: [LN:PYRG_BACSU] [AC:P13242] [GN : CTRA] [OR : BACILLUS SUBTILIS] [EC:6. 3.4.2] [DE : CTP 
SYNTHASE, (OTP- -AMMONIA LIGASE) (CTP SYNTHETASE)] [SP:P13242] [DB : swissprot ] 
>pir : [LN:SYBSTP] [AC : A32354 : S55423 : C69610 ] [PN:CTP 

synthase, : CTP- synthetase : OTP- -ammonia ligase] [GN: ctrA :pyrG ] [CL:CTP synthase] 
[OR:Bacillus subtilis] [EC:6.3.4.2] [DB:pirl] [MP:37 min ] >gp : [GI :gl43597] 
[LN:BACSPO0FA] [AC:M22039] [PN:CTP synthetase] [GN: ctrA] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strains JH642 and UOT0550) DNA] [DB : genpept-bctl] [DE:Bacillus 
subtillis spoOF, CTP synthetase (ctrA) , andf ructose-bisphosphate aldolase (orfY-tsr) 
genes, complete cds.] [LE:339] [RE:1946] [DIrdirect] >gp : [GI : g853762] [LN : BSDNA320D] 
[AC:Z49782] [PN:CTP synthase] [GN:pyrG] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis chromosomal DNA (region 320-321 degrees).] [SP:P13242] [LE:9480] 
[RE:11087] [DI: direct] >gp : [GI :ell862 16 :g2 636252] [LN : BSUB0020] [AC : Z9912 3 : AL009126] 
[PN:CTP synthetase] [GN: ctrA] [FN : pyrimidine biosynthesis] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [EC: 6. 3. 4. 2] [DE:Bacillus subtilis complete genome (section 20 of 
21): from 3798401to 4010550.] [NT : alternate gene name: pyrG] [SP:P13242] [LE.-11346] 
[RE: 12 953] [DI : complement] 



ORF Name 



IA17503000979 14434660 FT 26 



Description 
NO-HIT 
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17503000979 14492142 tl 81 



Description 
NO-HIT 
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Description 

pir: [LN:H71683] [AC:H71683 ] [PN : hypothetical protein RP285] [GN:RP285 ] [OR : Rickettsia 
prowazekii] [DB:pir2] >gp: [GI :el342590:g3 860846] [LN:RPXX02] [AC : AJ2352 71 : AJ235269] 

[PN:unknown] [GN:RP285] [OR : Rickettsia prowazekii] [DB : genpept-bctl] [DE : Rickettsia 
prowazekii strain Madrid E, complete genome; segment2/4.] [LE:68773] [RE:69213] 

[DI : complement] 
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NTID 



AAID 



IAI7503000979 14849093 c3 469 
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LengthLength 
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score probability 
1133 ||6.0e-09 



Description 

pir: [LN:F71245] [AC:F71245 ] [PN : hypothetical protein PHS004] [GN:PHS004 ] 
[OR:Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030236 : g3256610] [LN: AP000001] 
[AC:AP000001 :AB009465 : AB009464 : AB009466 :AB009467 :AB009468 :AB009469] [PN:58aa long 

hypothetical protein] [GN:PHS004] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 1-287000 

nt. position (1/7).] [NT:similar to GENPEPT : Z4754 7 percent identity : 50 . 000] [LE:195255] 
[RE: 195431] [DI : direct] 
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NTID AAID Le - thLe ^th ^^ probability 



IAI7503000979 14Bbb051 cl 401 



] |24 | [3796 |pi— ||15g |p^— || 3.4e-38 



Description 

pir: [LN:H69773] [AC:H69773 ] [PN : conserved hypothetical protein ydcK] [GNiydcK ] 
[CL: hypothetical protein HI1173] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl020070:gl881290] [LN : AB001488] [AC : AB001488] [GNiydcK] [ORrBacillus subtilis] 
[SR:Bacillus subtilis (strain:168) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT : FUNCTION 
UNKNOWN . ] [LE:61365] [RE:61817] [DIrdirect] >gp : [GI : 61182445 : g2632779] [LN : BSUB0003] 
[AC:Z99106 :AL009126] [GN : ydcK] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 3 of 21): from 402751 to611850.] 
[NT:similar to hypothetical proteins] [LE:125110] [RE:125562] [DI:direct] 



ORF Name 



NTID 



IAI7503000979 1S798901 c3 471 
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NO-HIT 
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Description 

sp: [LN:ATP6_BACST] [AC:P42010] [GN: ATPB] [OR:BACILLUS S TE AROTHERMO PH I LUS ] [EC : 3 . 6 . 1 . 34] 
[DE : ATP SYNTHASE A CHAIN, (PROTEIN 6)] [SP:P42010] [DB : swissprot] 

>gp: [GI:dl007828:g534857] [LN : BACATPSAC] [AC:D38059] [PN : ATPase subunit a] [OR:Bacillus 
stearothermophilus] [SR:Bacillus stearothermophilus (strain IFO1035) (library: library 
o] [DB:genpept-bctl] [DE:Bacillus stearothermophilus genes for ATPase subunits a and 
c, complete cds . ] [LE:256] [RE: 966] [DI: direct] 
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NO-HIT 
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Description 

sp: [LN:YWCF_BACSU] [AC:P39604] [GN: YWCF : IPA-42D] [OR:BACILLUS SUBTILIS] 

[DE: HYPOTHETICAL 43.3 KD PROTEIN IN QOXD-VPR INTERGENIC REGION] [SP:P39604] 

[DB: swissprot] >pir: [LN:S39697] [AC : S3 96 9 7 : A70053 ] [PN : cell -division protein homolog 

ywcF:protein ipa-42d] [GN :ywcF ] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI:g413966] 
[LN: BSGENR] [AC:X73124] [GN:ipa-42d] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genomic region (325 to 333).] [SP:P39604] [LE:44053] [RE:45234] 
[DI:direct] >gp: [GI :ell86311:g2 636347] [LN : BSUB0020] [AC : Z99123 : AL009126] [GN :ywcF] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate gene name: ipa-42d; 

similar to] [SP:P39604] [LE: 112728] [RE: 113909] [DI : complement] 
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Description 

sp: [LN:KITH_BACSU] [AC:Q03221] [GN: TDK] [OR:BACILLUS SUBTILIS] [EC : 2 . 7 . 1 . 21] 
[DE: THYMIDINE KINASE,] [SP:Q03221] [DB : swissprot] >pir : [LN : S55432 ] [AC : S55432 : D69721 ] 
[PN: thymidine kinase, tdk] [GN:tdk ] [CL : thymidine kinase] [OR: Bacillus subtilis] 
[EC:2.7.1.21] [DB:pir2] >gp : [GI : g405819] [LN : BACRHOTDKX] [AC:M97678] [PN : thymidine 
kinase] [GN:tdk] [OR: Bacillus subtilis] [SR: Bacillus subtilis (Transposon Tn917 
insertional library) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis OrfR, 3' end; OrfQ; 
transcriptional terminator (rho) gene; ribosomal protein L31; thymidine kinase (tdk) 
gene, complete cds . ] [NT : Incorrect sequence given in Quirk et al . citation,] [LE:3334] 
[RE:3921] [DIrdirect] >gp : [GI : g853771] [LN : BSDNA32 0D] [AC:Z49782] [PN: thymidine kinase] 
[GNrtdk] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE rB. subtilis chromosomal DNA 
(region 320-321 degrees).] [SP:Q03221] [LE:18786] [RE:19373] [DI:direct] 
>gp: [GI :ell84612 :g2636231] [LN : BSUB0019] [AC : Z99122 : AL009126] [PN : thymidine kinase] 
[GNrtdk] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 1 . 21] [DE:Bacillus subtilis 
complete genome (section 19 of 21): from 3597091to 3809700.] [SP:Q03221] [LE:204370] 
[RE:204957] [DI : complement ] >gp : [GI :ell86207:g2 636243] [LN : BSUB0020] 
[AC:Z99123 :AL009126] [PN : thymidine kinase] [GN:tdk] [ORrBacillus subtilis] 
[DB: genpept-bctl] [EC : 2 . 7 . 1 . 21] [DE:Bacillus subtilis complete genome (section 20 of 
21): from 3798401to 4010550.] [SP:Q03221] [LE:3060] [RE:3647] [DI : complement] 
>gp: [GI :ell84612 :g2636231] [LN: BSUB0019] [AC : Z99122 : AL009126] [PN : thymidine kinase] 
[GNrtdk] [ORrBacillus subtilis] [DBrgenpept] [EC r 2 . 7 . 1 . 21] [DErBacillus subtilis 
complete genome (section 19 of 21): from 3597091to 3809700.] [SP:Q03221] [LE:204370] 
[RE : 204957] [DI : complement ] 
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|M7503000979_200325a7_t2_127 
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NO-HIT 
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probability 
7.8e-39 
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Description 

spr [LNrYWLG^BACSU] [AC:P39157] [GN : YWLG : IPC- 33D] [OR : BACILLUS SUBTILIS] 
[DEr HYPOTHETICAL 19.4 KD PROTEIN IN SPOIIR-GLYC INTERGENIC REGION] [SPrP39157] 
[DBr swissprot] >pir : [LNr 140482] [AC : 14 0482: D7 0062: S4 9362 ] [PN: hypothetical protein 
ywlG:ipc-33d protein] [GNrywlG : ipc-33d ] [OR:Bacillus subtilis] [DB:pir2] 
>gp r [GIrg556885] [LN r BSSPORUPP] [ACrZ38002] [PNrUnknown] [GN:ipc-33d] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE : B . subtilis spoII-R, glyC and upp genes.] [SPrP39157] 
[LE:4750] [REr5292] [DIrdirect] >gp r [GI r ell 84 597 rg2 636216] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [GNrywlG] [FNrunknown] [ORrBacillus subtilis] [DB r genpept-bctl] 
[DErBacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: alternate gene name: ipc-33d] [SP:P39157] [LE: 192609] [RE: 193151] [DI : complement] 
>gp: [GI:ell84597:g2636216] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN:ywlG] [FN:unknown] 
[OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome (section 19 
of 21): from 3597091to 3809700.] [NT : alternate gene name: ipc-33d] [SP:P39157] 
[LE : 192609] [RE : 193151] [DI : complement] 



92 



ORF Name 



NT AA 
NTID AAID — - T — , , score 
LengthLength 



probability 



[AI 7 503000979 203484b3 c3 518 
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Description 

gp: [GI :g4409804] [LN : AF091502] [AC : AF091502] [PN : autoaggregat ion-mediating protein] 
[GN:aggH] [OR : Lactobacillus reuteri] [DB : genpept -bct2 ] [DE : Lactobacillus reuteri 
autoaggregation-mediating protein (aggH)gene, complete cds . ] [NT : AggH; putative 
extracellular DEAD-box RNA helicase] [LE:181] [RE: 1674] [DI: direct] 
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Description 
NO-HIT 
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Description 

pir: [LN:G70041] [AC:G70041 ] [PN : conserved hypothetical protein yvgZ] [GNryvgZ ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86040 : g2635865] [LN: BSUB0018] 
[AC:Z99121 :AL009126] [GN:yvgZ] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 18 of 21): from 3399551to 3609060.] 
[NT: similar to hypothetical proteins] [LE: 43408] [RE: 43713] [DI : complement] 
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Description 

gp: [GI:gl086838] [LN : CELF10E7] [AC:U41264] [GN:F10E7.2] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept - invl] [DE : Caenorhabditis 
elegans cosmid F10E7.] [NT:also contains weak similarity to a C2H2-type zinc] 
[LE : 21609: 21709: 23024 : 232 77] [RE : 21661 : 21796 : 23149 : 23 753 ] [DI : direct Join] 



ORF Name 



NT AA 

NTID AAID — ~ , , score 

Length Length 



probability 



IM7503000979 2117125 c2 432 
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Description 
NO-HIT 
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NO-HIT 
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Description 

gp: [GI:dl037674:g4126673] [LN: AB016431] [AC : AB016431] [GN:CzrA] [FN : repressor] 
[OR: Staphylococcus aureus] [SR : Staphylococcus aureus (strain: 912) DNA] 
[DB :genpept-bctl] [DE : Staphylococcus aureus, zinc responsible operon czr genes, 
completeand partial cds.] [LE:1703] [RE:2023] [DI : complement] >gp : [GI : g3445566] 
[LN:AF044951] [AC : AF044 951] [PN : repressor protein] [GN : rzcA] [FN: zinc and cobalt 
transport repressor] [OR : Staphylococcus aureus] [DB : genpept-bct2] [DE : Staphylococcus 
aureus repressor protein (rzcA) and transportprotein (rzcB) genes, complete cds.] 
[LE:193] [RE:513] [DIrdirect] 
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Description 
NO-HIT 
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Description 

sp: [LN:THIE_BACSU] [AC:P39594] [GN : THIE : THIC : IPA-26D] [OR:BACILLUS SUBTILIS] 
[EC:2.5.1.3] [DE : PYROPHOSPHORYLASE) (TMP-PPASE) (THIAMIN- PHOSPHATE SYNTHASE)] 
[SP:P39594] [DB : swissprot] >pir : [LN : S3 9681] [AC : S3 9681 : E6 9722 ] [PN : thiamin-phosphate 
pyrophosphorylase, thiC: protein ipa-26d] [GN:thiC ] [CL:thiE protein : thiamin-phosphate 
pyrophosphorylase homology] [OR: Bacillus subtilis] [EC: 2. 5. 1.3] [DB:pir2] 
>gp: [GI:g413950] [LN: BSGENR] [AC:X73124] [GN:ipa-26d] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39594] [LE:26188] 
[RE:26856] [DI: direct] >gp: [GI : ell86328 : g2636364] [LN : BSUB0020] [AC : Z99123 : AL009126] 
[PN: thiamine-phosphate pyrophosphorylase] [GN:thiC] [FN : substitution of the 
pyrophosphate of] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.5.1.3] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: ipa-26d, ywbK] [SP:P39594] [LE: 131103] [RE: 131771] [DI : complement] 
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Description 

Sp: [LN: YPGQ_BACSU] [AC:P54168] [GN: YPGQ] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 23.1 
KD PROTEIN IN BSAA-ILVD INTERGENIC REGION] [SP:P54168] [DB : swissprot] >pir: [LN:E69935] 
[AC:E69935 ] [PN : conserved hypothetical protein ypgQ] [GN:ypgQ ] [CL : conserved 
hypothetical protein AF0994] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : gl256633] 
[LN : BACYACA] [AC:L77246] [GN : ypgQ] [OR:Bacillus subtilis] [DB : genpept-bct 1] 
[DE: Bacillus subtilis (YAC10-9 clone) DNA region between the serA andkdg loci.] 
[NT:putative] [LE:16914] [RE:17531] [DI:direct] >gp: [GI : el 183636 :g2 634609] 
[LN:BSUB0012] [AC : Z99115 : AL009126] [GN:ypgQ] [FN:unknown] [OR:Bacillus subtilis] 
[DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 12 of 21): from 
2195541to 2409220.] [NT:similar to hypothetical proteins] [SP:P54168] [LE:107621] 
[RE: 108238] [DI complement] 
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ORF Name 



NT AA 
NT ID AAID — - _ — , . score 
LengthLength 



probability 



H7503000979 22460882 c3 452 



] | 44 1 I 3816 IP^P** ||gT2 — ||1.0e-59 



Description 

gp: [GI:ell91863:g809660] [LN:BSDNPOP] [AC:X82174] [PN : deoxyribose -phosphate aldolase] 
[GN:dra] [OR: Bacillus subtilis] [DB : genpept-bctl] [EC: 4. 1.2. 4] [DE : B . subtilis operon 
contg. dra, nupC and pdp genes.] [SP:P39121] [LE:1462] [RE: 2106] [DI:direct] 



ORF Name 



NT AA 
NT ID AAID — _ — , . score 
Length Length 



probability 



^17503000979 22692137 C2 464 



] I 45 I I 3617 l|^ni lgl |P 8 ||9.2e-68 



Description 

gp: [GI :e279934 :gl934990] [LN : SASIGFACB] [AC:Y07645] [GN:rsbW] [OR : Staphylococcus 
aureus] [DB : genpept-bctl] [DE:S. aureus sigB gene.] [LE:2220] [RE:2699] [DI: direct] 
>gp: [GI:e284999:gl729796] [LN:SAUSIGB] [AC:Y09929] [GN:rsbW] [OR : Staphylococcus aureus] 
[DB: genpept-bctl] [DE : S . aureus rsbU, rsbV, rsbW & sigB genes.] [LE:2233] [RE:2712] 
[DI : direct] 



ORF Name 



NT AA 

NTID AAID — ^ , — ^, score probability 
LengthLength 



E 



I7S03000979 23437803 c3 S23 



J |4b | [3818 ||2187]|728 | [1827 ||1 . 9e -TOT" 



Description 

gp: [GI:e279936:gl934992] [LN : SASIGFACB] [AC:Y07645] [GN:ORF6] [OR : Staphylococcus 
aureus] [DB : genpept-bctl] [DE:S. aureus sigB gene.] [LE:3878] [RE:>5272] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ — , score 
LengthLength 



probability 



^I7S03000979 23439002 c2 417 



]E 



[3819 ||T5UTj p^T 



Description 
NO-HIT 

ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IAI7503000979 2344S887 £3 523 



] | 48 1 p820 |pT3T]|384 | p3~ ||6 . 3e-53 



Description 

gp: [GI:g4097757] [LN : SAU67965] [AC:U67965] [PN: lytic regulatory protein] 
[OR staphylococcus aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus lytic regulatory 
protein gene, complete cds . ] [LE:712] [RE: 1824] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — score 
LengtnLengtn 



probability 



IAI7503000979 23S940S7 cl 3S2 



-| |49 | [3821 ||183~|p | 



Description 
NO-HIT 
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ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



03000979 23b9bl37 c3 480 



] |50 | [3822 |pi~ ||166 |pT~]|l.le-39 



Description 

pir: [LN:H69618] [AC:H69618 ] [PN:stress- and starvation- induced gene controlled by 
sigma-B dps ] [GN:dps ] [CL : hypothetical protein HI1349] [OR:Bacillus subtilis] 
[DB:pir2] >gp : [GI : ell85938 : g2635549] [LN: BSUB0016] [AC : Z99119 : AL009126] [GNrdps] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
16 of 21): from 2997771to 3213410.] [NT : alternate gene name: ytkB; stress- and] 
[SP:P80879] [LE:137548] [RE:137985] [DI : complement] >gp : [GI : g22 93159] [LN: AF008220] 
[AC:AF008220] [GN:ytkB] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 
rrnB-dnaB genomic region.] [NT : similarity to HI1349 from H. influenzae] [LE:42442] 
[RE:42879] [DI :direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IAI7503000979 2362b008 cl 347 



][ 



FT 



[3823 |p3~||320 |[7W"||3.2e-74 



Description 

gp: [GI:dl039089:g4512388] [LN: AB011838] [AC : AB011838] [PN :mannnose-6 phospate 
isomelase] [GN:ydhS] [OR:Bacillus halodurans] [SR:Bacillus halodurans (strain : C- 125) 
DNA, clone_lib: lambda no.] [DB :genpept-bctl] [EC:5.3.1.8] [DE:Bacillus halodurans C-125 
genomic DNA, 6A fragment, clone ALBAC004 . ] [NT:similar to B. subtilis ydhS gene(53-69% 
identity)] [LE:7672] [RE: 8619] [DI: direct] 



ORF Name 



|AI7503000979_23626367_cl_402 

Description 

NO-HIT 



XTm ^ NT AA 

NTID AAID — — ^ score 

LengthLength 



probability 



[3824 |p^~| 



ORF Name 



IA17503000979 23634678 c2 424 



Description 
NO-HIT 



ORF Name 



E 



l!7!>03000979 23634702 c2 453 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



] [S3 | [3825 ||li94||397 | 



NTID 



AAID 



NT AA 
LengthLength 
797 



score probability 
1.3e-47 



Description 

sp: [LN:THIM_BACSU] [AC:P39593] [GN:THIM: THIK: IPA-25D] [OR:BACILLUS SUBTILIS] 
[EC:2.7.1.50] [DE : HYDROXYETHYLTHIAZOLE KINASE) (THZ KINASE) (TH KINASE)] [SP:P39593] 
[DB:swissprot] >pir : [LN: S39680] [AC : S39680 : G69722 ] [PN : hydroxyethylthiazole kinase, 
thiK:protein ipa-25d] [GN:thiK ] [CL : hydroxyethylthiazole kinase: hydroxyethylthiazole 
kinase homology] [OR:Bacillus subtilis] [EC : 2 . 7 . 1 . 50] [DB:pir2] >gp : [GI :g413949] 
[LN : BSGENR] [AC:X73124] [GN:ipa-25d] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genomic region (325 to 333).] [SP:P39593] [LE:25373] [RE:26191] 
[DI:direct] >gp: [GI : el 186329 :g2 636365] [LN: BSUB0020] [AC : Z99123 : AL009126] 
[PN: hydroxyethylthiazole kinase] [GN:thiK] [FN phosphorylation of] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [EC : 2 . 7 . 1 . 50] [DE:Bacillus subtilis complete genome 
(section 20 of 21): from 3798401to 4010550.] [NT : alternate gene name: ipa-25d, ywbJ] 
[SP:P39593] [LE:131768] [RE:132586] [DI : complement] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000979 23651702 ci 343 



J |55 | [3927 ||870 1 1289 |p^~ 



probability 
V . le-22 



Description 

pir: [LN:C70070] [AC:C70070 ] [PN : conserved hypothetical protein ywtE] [GNrywtE ] 
[CL: hypothetical protein ywpj] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI :ell84491:g2636110] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN : ywtE] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NTisimilar to hypothetical proteins] [LE:97330] 
[RE:98190] [DI:direct] >gp : [GI : e308093 :gl894770] [LN : BSZ92954] [AC:Z92954] [GNrywtE] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis yws [A, B, C, D, E , F, G] and gerBC 
genes.] [NT:product similar to Bacillus subtilis YxeH and YcsE] [LE:4292] [RE: 5152] 
[DI: complement] >gp: [GI : ell84491 :g2636110] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN:ywtE] 
[FN:unknown] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT:similar to hypothetical proteins] 
[LE:97330] [RE:98190] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — , — _ score 
LengthLength 



probability 



17503000979 23711S42 ci 397 



][ 



[3828 ||i0Z5~|[341 1 1±445 | 



5.Se-148 



Description 

gp: [GI:e284997:gl729794] [LN:SAUSIGB] [AC:Y09929] [GN:rsbU] [OR : Staphylococcus aureus] 
[DB: genpept-bctl] [DE:S. aureus rsbU, rsbV, rsbW & sigB genes.] [LE:784] [RE: 1785] 
[DI:direct] 



ORF Name 



„ T „ NT AA 
AAID — — _ score 
LengthLength 



IAI7503000979 2379658 £2 237 



Description 
NO-HIT 



NTID 

] | 57 | |3829 ||225 1 174 | 



probability 



ORF Name 



_ Tr _ _ NT AA 

NTID AAID — _ T — . , score 

LengthLength 



probability 



IAT7503000979 2383253 c2 405 
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Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



IAI7503000979 24015687 c2 438 



] |59 | |383i |pB~[| 154 | pT^~[ |l . 8e-0 T" 



Description 

sp: [LN:ATPZ_BACP3] [AC:P09354] [OR:BACILLUS PS3] [SR :, THERMOPHILIC BACTERIUM PS-3] 
[DE : ATP SYNTHASE PROTEIN I] [SP:P09354] [DB : swissprot] >pir : [LN : S01397] [AC:S01397 ] 
[PN:H+-transporting ATP synthase, chain I] [CL:Bacillus H+- transporting ATP synthase 
chain I] [OR : thermophilic bacterium PS-3] [EC : 3 . 6 . 1 . 34] [DB:pir2] >gp : [GI :g45809] 
[LN:PS3TF0F1] [AC : X07804 : X073 74] [OR : thermophilic bacterium PS3] [DB : genpept-bctl] 
[DE thermophilic bacterium PS3 TF0F-1 operon for ATP synthase complex.] [NT: I protein 
(AA 1 - 127)] [SP:P09354] [LE:433] [RE:816] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID . , _ — v 
LengthLength 



score 



IAI7503000979 24074137 c3 492 



|50 | [3832 |p?T]|443 | p5TT| [ 



probability 
5.7e-139 



Description 

gp: [GI:gl43434] [LN: BACRHOTDKX] [AC:M97678] [PN:Rho Factor] [GN:rho] 
[FN transcriptional terminator] [OR: Bacillus subtilis] [SR: Bacillus subtilis 
(Transposon Tn917 insertional library) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 

OrfR, 3' end; OrfQ; transcriptional terminator (rho) gene; ribosomal protein L31; 

thymidine kinase (tdk) gene , complete cds . 3 [LE:1643] [RE:2926] [DI:direct] 
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ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



AI7503000979 24228411 Cl 375 



] |6i I [ 3833 | p9~||132 | |[ 8 . 6e-33 



Description 

gp: [GI :g2735512] [LN : SCU96108] [AC:U96108] [PN : single- strand binding protein homolog] 
[GN:ssb] [OR: Staphylococcus carnosus] [DB :genpept-bct2] [DE : Staphylococcus carnosus 
(3R) -hydroxymyristoyl acyl carrier proteindehydrase homolog (fabZ) gene, partial cds, 

YwpF homolog, single-strand binding protein homolog (ssb) , SceD precursor (sceD),SceA 

precursor (sceA) and SceE precursor (sceE) genes, completecds, and TenA homolog (tenA) 

gene, partial cds . ] [NT:SSB] [LE:857] [RE:1294] [DIrdirect] 



ORF Name 



NTID 



7v7vTr ^ NT AA 

AAID — — score 

LengthLength 



probability 



IAI7503000979 24235552 ci 338 



][ 
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Description 

gp: [GI:e284998:gl729795] [LN:SAUSIGB] [AC:Y09929] [GN:rsbV] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE:S. aureus rsbU, rsbV, rsbW & sigB genes.] [LE:1905] [RE:2231] 
[DI : direct] 



ORF Name 



NTID 



NT 

AAID _ 



AA 



LengthLength 



score 



IAI7503000979 24245327 cl 354 
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probability 
3.5e-84 



Description 

gp: [GI:e258331:gl765902] [LN : BCUPPGLYA] [AC:X99545] [PN:uracil 

phosphor ibosyl trans f erase] [GN:upp] [OR:Bacillus caldolyticus] [DB : genpep t -bet 1 ] 
[DE:B.caldolyticus upp gene.] [SP:P70881] [LE:431] [RE: 1060] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



.. — „ — . score probability 
LengthLength c *- 



1A17503000979 24254202 cl 368 



] |b4 | [3836 | [1517] |5 03 | puTTT||4 . 3e-212 



Description 

sp : [ LN : ATPA_BACME ] [AC:P17674] [GN : ATPA] [OR : BACILLUS MEGATER I UM] [EC : 3 . 6 . 1 . 34 ] [DE : ATP 
SYNTHASE ALPHA CHAIN,] [SP:P17674] [DB : swissprot] >pir : [LN : F31482] [AC:F31482 ] 
[PN:H+-transporting ATP synthase, alpha chain] [CL : H+- transporting ATP synthase alpha 
chain: H+- transporting ATP synthase alpha chain homology] [OR: Bacillus megaterium] 
[EC:3.6.1.34] [DB:pir2] >gp : [GI:gl42559] [LN : BACATPA] [AC : M202 55 : J04455 :M18352 : M23 924 ] 
[PN:ATP synthase alpha subunit] [OR:Bacillus megaterium] [SR : B .megaterium (QM B1551) 
DNA, clones pWSBlOO, pCAH1.3, and pWPC208] [DB .-genpep t- bet 1] [DE : B . megaterium ATP 
synthase i , a, c, b, delta, alpha, gamma, beta andepsilon subunit genes, complete cds, and 
ORF.] [LE:2853] [RE:4361] [DI: direct] 



ORF Name 



117503000979 242565^2 cl 381 



NTID 



AAID — 



NT AA 
LengthLength 
TIE 



score probability 



Ffl 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000979 24350953 tl 2 
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probability 
| |0.017 



Description 

pir: [LN:G71244] [AC:G71244 ] [PN : hypothetical protein PH0217] [GN:PH0217 ] 
[OR : Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030229 : g3256603] [LN : AP000001] 
[AC:AP000001 :AB009465 :AB009464 :AB00 9466 :AB009467 :AB00946 8 :AB009469] [PN:106aa long 

hypothetical protein] [GN:PH0217] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB : genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 1-287000 

nt. position (1/7).] [LE:191072] [RE:191392] [DI : complement ] 



98 



NT AA 

QRF Name MTID AAID Le — thLe — th score probability 



M7B03000979_243i>34a7_c3J>03 | [ 67 | [ 3839 | pi~" | | 180 | pf3~|| 4 . Oe-19 



Description 

sp: [LN:ATPD_BACSU] [AC:P37811] [GNrATPH] [OR : BACILLUS SUBTILIS] [EC : 3 . 6 . 1 . 34] [DE : ATP 
SYNTHASE DELTA CHAIN, ] [SP:P37811] [DB : swissprot] >pir : [LN : 140365] 
[AC:I40365:D69592:S39253 ] [PN :H+- transporting ATP synthase, delta chain (atpH) ] 
[GNratpH ] [CL r H+- transporting ATP synthase delta chain] [OR:Bacillus subtilis] 
[EC:3.6.1.34] [DB:pir2] >gp : [GI : g433988] [LN : BSATPASE] [AC:Z28592] [PN: ATP synthase 
subunit delta] [GN : atpH] [ORrBacillus subtilis] [DB : genpep t - bet 1 ] [DE : B . subtilis (168) 
atpase genes for ATP synthase subunits i, a, c ,b, delta, alpha, gamma, beta, epsilon.] 
[SP:P37811] [LE:2484] [RE:3029] [DIrdirect] >gp : [GI : ell84590 :g2636209] [LN : BSUB0019] 
[AC:Z99122:AL009126] [PN:ATP synthase (subunit delta)] [GNratpH] [ORrBacillus subtilis] 
[DBrgenpept-bctl] [EC r 3 . 6 . 1 . 34] [DErBacillus subtilis complete genome (section 19 of 
21) r from 3597091to 3809700.] [SPrP37811] [LErl87368] [RErl87913] [DI r complement] 
>gpr [GIrell84590rg2636209] [LN r BSUB0019] [AC r Z99122 r AL009126] [PN : ATP synthase (subunit 
delta)] [GNratpH] [ORrBacillus subtilis] [DBrgenpept] [EC r 3 . 6 . 1 . 34] [DErBacillus 
subtilis complete genome (section 19 of 21) r from 3597091to 3809700.] [SP:P37811] 
[LE r 187368] [RE: 187913] [DI r complement] 



ORF Name 


NTID 


NT AA 
AAID — — 

LengthLength 


score 


probability 


AI7503000979_24392i93_cl_346 
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Description 
NO-HIT 










ORF Name 


NTID 


„ TO NT AA 

AAID 

LengthLength 


score 


probability 


|M7S03000979_244062S0_c2_433 


ir 


| | 38 41 |p»6||4il | 


|I258 | 


|3.7e-128 | 



Description 

Sp r [LN r MURA_BACSU] [AC : P19670 r Q03225] [GN r MURA r MURZ] [ OR r B AC I LLUS SUBTILIS] 
[EC r 2. 5. 1.7] [DE r ENOLPYRUVYL TRANSFERASE) (EPT) ] [SP r P196 70 r Q03225] [DB r swissprot ] 
>pirr [LNrG32354] [AC r S55428 : G32354 r H69662 ] [PN : UDP-N-acetylglucosamine 
1-carboxyvinyltransf erase, murZ] [GNrmurZ ] [CL r UDP-N-acetylglucosamine 
1-carboxyvinyl transferase MurZ] [ORrBacillus subtilis] [ECr 2.5.1.7] [DBrpir2] 
>gpr [GIrg853767] [LN : BSDNA320D] [AC:Z49782] [PN : UDP-N-acetylglucosamine] [GNrmurZ] 
[ORrBacillus subtilis] [DBrgenpept-bctl] [DE r B . subtilis chromosomal DNA (region 320-321 
degrees).] [SPrP19670] [LErl4403] [RErl5692] [DIrdirect] >gp : [GI : ell84616 : g2636235] 
[LN:BSUB0019] [ACrZ99122 rAL009126] [PN :UDP-N-acetylglucosamine] [GNrmurZ] 
[FNrpeptidoglycan biosynthesis] [ORrBacillus subtilis] [DBrgenpept-bctl] [ECr2.5.1.7] 
[DErBacillus subtilis complete genome (section 19 of 21) : from 3597091to 3809700.] 
[NT r alternate gene name r murZ, IssF, rev-4] [LEr 208051] [RE r 209340] [DI : complement] 
>gpr [GIrell86211:g2636247] [LN : BSUB0020] [AC : Z99123 r AL009126] 

[PNr UDP-N-acetylglucosamine] [GNrmurZ] [FNrpeptidoglycan biosynthesis] [ORrBacillus 
subtilis] [DBrgenpept-bctl] [ECr 2. 5. 1.7] [DErBacillus subtilis complete genome (section 
20 of 21) r from 3798401to 4010550.] [NT r alternate gene name : murZ, IssF, rev-4] 
[LEr6741] [REr8030] [DI r complement ] >gp r [GI rell84616 rg2 636235] [LN rBSUB0019] 
[ACrZ99122 rAL009126] [PN rUDP-N-acetylglucosamine] [GNrmurZ] [FNrpeptidoglycan 
biosynthesis] [ORrBacillus subtilis] [DBrgenpept] [ECr 2. 5. 1.7] [DErBacillus subtilis 
complete genome (section 19 of 21) : from 3597091to 3809700.] [NT r alternate gene name: 
murZ, IssF, rev-4] [SPrP19670] [LE:208051] [RE:209340] [DI : complement] 



NT AA 

ORF Name NTID AAID — . — , _ score probabil ity 
LengthLength c -L 

|AI7503000979_2440763:L_t 3^252 | [70 | [ 3842 [ 47 | 

Description 
NO-HIT 



99 



NT AA 

ORFName NTID AAXD Le ~ thLe ~ th probability 



AI75030009VajM41i>93J_c2_41fl | | 7i | [ 3843 — j | 144 |p || 0.025 



Description 

sp: [LN:YUXK_BACSU] [AC : P40761 : 005233] [GN: YUXK] [0R:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
15.7 KD PROTEIN IN PBPD-COMA INTERGENIC REGION (ORF2)] [SP : P40761 : 005233] 
[DBrswissprot] >pir : [LN: B55220] [AC : B55220 :D70025 ] [PN : hypothetical protein yuxKrpbpD 
3' -region hypothetical protein] [GN:yuxK ] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:g520537] [LN: BSU11882] [AC:U11882] [PN:unknown] [GN :pbpD] [0R:Bacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis orfl, partial cds, penicillin-binding 
protein 4 (pbpD) gene, complete cds, and orf2, complete cds.] [NT:orf2] [LE:2760] 
[RE:3173] [DI:direct] >gp: [GI : ell84228 :g2635646] [LN : BSUB0017] [AC : Z99120 : AL009126] 
[GN : yuxK] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 17 of 21): from 3197001to 3414420.] [NT : alternate gene name: 
yugD] [SP:P40761] [LE:37885] [RE:38298] [DI: direct] >gp : [GI : e311522 :gl934785] 
[LN:BSZ93 933] [AC:Z93 933] [PN: unknown] [GN:yugD] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic DNA fragment from yugA to yugD.] [SP:P40761] 
[LE:7318] [RE:7731] [DI: direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|&I7S03000979_2460flB63_c2_436 [ |72 | [ 3844 | pu^7] |353 | |122g pTue -m 

Description 

sp: [LN:RF1_BACSU] [AC:P45872] [GN: PRFA] [OR: BACILLUS SUBTILIS] [DE : PEPTIDE CHAIN 
RELEASE FACTOR 1 (RF-1) ] [SP:P45872] [DB : swissprot] >pir : [LN : S5543 7] [AC : S55437 : G69681 
] [PN: translation releasing factor RF-l:peptide chain release factor 1] [GN:prf A: RF-1 ] 
[CL: translation releasing factor] [OR:Bacillus subtilis] [DB:pirl] >gp: [GI:g853776] 
[LN: BSDNA32 0D] [AC:Z49782] [PN:peptide chain release factor 1] [GN :prf A] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis chromosomal DNA (region 320-321 degrees).] 
[NT:gtg start codon] [SP:P45872] [LE:23623] [RE:24693] [DI: direct] 

>gp: [GI:ell84607:g2636226] [LN: BSUB0019] [AC : Z99122 : AL009126] [PN:peptide chain release 
factor 1] [GN :prf A] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 19 of 21) : from 3597091to 3809700.] [SP:P45872] [LE: 199050] 
[RE:200120] [DI : complement ] >gp: [GI :ell84607:g2 636226] [LN:BSUB0019] 
[AC:Z99122 :AL009126] [PN:peptide chain release factor 1] [GN :prf A] [OR:Bacillus 
subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome (section 19 of 21) : from 
3597091to 3809700.] [SP:P45872] [LE:199050] [RE:200120] [D I : complement] 

NT AA 

ORF Name NTID AAID — _ — , , score probabilit y 
LengthLength c 



A17B03000979_2462S216_t3_264 | | 73 | [ 3845 |p5~| | 61 | 



Description 
NO-HIT 



100 



ORF Name 



NTID 



NT 

AAID __ 



AA 



LenqthLength 



score 



IAI7503000979 2464U87 c2 437 



| 7 4 | [3846 |p^— [|141 \^TJ— | [ 



probability 
5 . 7e-20 



Description 

sp: [LN:YWLE_BACSU] [AC:P39155] [GN: YWLE : IPC- 3 ID] [OR:BACILLUS SUBTILIS] [EC : 3 . 1 . 3 . 48 ] 
[DE:(EC 3.1.3.48)] [SP:P39155] [DB : swissprot] >pir : [LN : S49360] [AC : 140479 :B70062 : S49360 

] [PN:protein-tyrosine-phosphatase, homolog ywlE, low molecular weight : ipc-31d protein] 
[GN : ywlE : ipc - 3 Id ] [CL : protein- tyrosine -phosphatase , low molecular weight] [ORrBacillus 

subtilis] [EC:3 .1.3.48] [DBrpirl] >gp : [GI :g556883] [LN : BSSPORUPP] [AC:Z38002] 
[PNrUnknown] [GN:ipc-31d] [ORrBacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis 

spoII-R, glyC and upp genes.] [SP:P39155] [LE:3679] [RE:4131] [DI:direct] 

>gp: [GI:ell84599:g2636218] [LN: BSUB0019] [AC : Z99122 : AL009126] [GNrywlE] [FNrunknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 

19 of 21): from 3597091to 3809700.] [NT : alternate gene name: ipc-31d; similar to] 
[SP:P39155] [LE:193770] [RE:194222] [DI : complement] >gp : [GI :ell84599:g2 636218] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [GN : ywlE] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept] [DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 

3809700.] [NTralternate gene name: ipc-31d; similar to] [SP:P39155] [LE: 193770] 
[RE: 194222] [DI : complement ] 



ORF Name 



NTID 



AAID 



NT 



AA 



— , _ — , score probability 
LengthLength <c J - 



^7503000979 24S42963 c2 458 



] |75 | [3847 |pT^~||71 |[5T ||0.00i9 



Description 

sp : [LN : COPP_HELFE] [AC:O32620] [GN:COPP] [OR : HELICOBACTER FELIS] [DE: COP ASSOCIATED 
PROTEIN (COPPER ION BINDING PROTEIN)] [SP:O32620] [DB : swissprot] 

>gp: [GI:e353967:g2660543] [LN:HFAJ1932] [AC : AJ001932] [GN:copP] [FN:divalent cation 
binding protein] [OR Helicobacter felis] [DB : genpept-bctl] [DE : Helicobacter felis ftsH, 
copA, copP genes and two ORF's.] [SP:O32620] [LE:5306] [RE:5506] [DI:direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



17503000979 24647558 cl 352 



] |76 | [3848 |p^7~ ||118 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



IAI7503000979 24565932 c3 496 



][ 



77" 



[3849 ||gTu^ | prg~ 



score probability 
| |477 |pTTe-45 



Description 

Sp : [LN : HEMK_BACSU] [AC:P45873] [GN: YWKE] [OR:BACILLUS SUBTILIS] [DE:HEMK PROTEIN 
HOMOLOG] [SP:P45873] [DB : swissprot] >pir : [LN : S55438] [AC : S55438 : D70061 ] 
[PN:protoporphyrinogen oxidase homolog ywkE] [GN : ywkE ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI:g853777] [LN : BSDNA320D] [AC:Z49782] [GN:ywkE] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE :B . subtilis chromosomal DNA (region 320-321 degrees).] [NT:product 
similar to E.coli PRFA2 protein] [SP:P45873] [LE:24695] [RE:25561] [DI:direct] 
>gp: [GI : el 1846 06 :g2 636225] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN: ywkE] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT:similar to protoporphyrinogen oxidase] 
[SP:P45873] [LE:198182] [RE:199048] [D I : complement] >gp : [GI : ell 84 606 :g2 636225] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [GN:ywkE] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept] [DE:Bacillus subtilis complete genome (section 19 of 21) : from 3597091to 
3809700.] [NT: similar to protoporphyrinogen oxidase] [SP:P45873] [LE: 198182] 
[RE: 199048] [DI : complement] 
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ORF Name 



[73 



NT AA 
Length Length 

i p850 |psi | pre 



NTID AAID r .— ^ scor e probability 

I6.3e-I4 



|AI 7 503000979_2b41301_c3_b0b 

Description 

pir: [LN:G70070] [AC:G70070 ] [PN : hypothetical protein ywzB] [GN : ywzB ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : ell84584 :g2636203] [LN: BSUB0019] [AC : Z99122 : AL009126] 
[GNrywzB] [FNrunknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 19 of 21): from 3597091to 3809700.] [LE:182127] [RE:182357] 
[DI: complement] >gp: [GI :ell84584 :g2 636203] [LN: BSUB0019] [AC : Z99122 : AL009126] [GN : ywzB] 
[FNrunknown] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [LErl82127] [RErl82357] [DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



^17503000979 25422081 c3 475 



[3851 ipr^ip "|pr 



probability 
u~uT3 



Description 

sp: [LN:ETF1_F0WP1] [ACrP21966] [GNr FPD6] [OR r FOWL POX VIRUS] [SRrFP-lJ [DErEARLY 
TRANSCRIPTION FACTOR 70 KD SUBUNIT] [SPrP21966] [DB r swissprot] >pir : [LN r F35216] 
[ACrF3 5216 ] [PNr FPD6 protein] [CLr vaccinia virus early transcription factor 70K chain] 
[OR rfowlpox virus] [DB:pir2] >gp : [GI :g61229] [LN: POFPHIND] [AC:X17202] [GN:ORF FPD6] 
[FN: Vaccinia D6 homolog] [OR:Fowlpox virus] [DB : genpept - vrl ] [DE:Fowlpox virus DNA 
sequence (Hind III fragment).] [SP:P21966] [LE:6614] [RE:8431] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



LA17503000979 26173800 ci 387 



Description 
NO-HIT 



] |80 | [3852 |[T51g [ |5i | 



ORF Name 



^17503000979 25182767 c2 425 



NTID AAID 



FT" 



3TT5T" 



NT AA 
LengthLength 
TZ1 



score probability 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — — , , scor e 
LengthLength 



probability 



IA17503000979 25212756 t2 137 



] F I P 854 ||i^58"||455 |pT7T]|2.3e-i51 



Description 

sp: [LN:YWNE_BACSU] [AC:P71040] [GN: YWNE] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 55.8 

KD PROTEIN IN SPOIIQ-MTA INTERGENIC REGION] [SP:P71040] [DB : swissprot] >pir : [LN : G70063 ] 
[AC:G70063 ] [PN : cardiolipin synthase homolog ywnE] [GN : ywnE ] [CL: Bacillus probable 

cardiolipin synthetase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84565 :g2636184] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [GN: ywnE] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept -bet 1] [DE:Bacillus subtilis complete genome (section 19 of 21): from 

3597091to 3809700.] [NT:similar to cardiolipin synthase] [SP:P71040] [LE:164628] 
[RE:166076] [DIrdirect] >gp: [GI : e269549 : gl592 701] [LN:BSUEROP] [AC:Y08559] [PN:Unknown] 
[GN : ywnE] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis urease operon and 

downstream DNA.] [NT: Product similar to Escherichia coli cardiolipin] [SP:P71040] 
[LE:5155] [RE:6603] [DI : complement ] >gp: [GI:ell84565:g2 636184] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [GN : ywnE] [FN:unknown] [OR:Bacillus subtilis] [DB:genpept] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT:similar to cardiolipin synthase] [SP:P71040] [LE:164628] [RE:166076] [DI:direct] 
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NT AA 

ORFName NTID AAI5 Le ^thLe^th ^^ probability 



AI7503000979__26360250_c3_b20 | | B3 | [ 3855 ||375 | | 124 | ■ le-i9 

Description 

gp: [GI:el340260:g3850850] [LN : SAU16431] [AC:Y16431] [PN:dpj protein] [GNrdpj] 
[OR: Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF»s.] [LE:2975] [RE: 3334] [DI : direct] 



ORF Name 



NTID 



NT 



AA 



AAID , 

LengthLength 



score 



IAI7503000979 25751542 ii 73 



] I 84 1 | 3S5g IF^~ll lg3 IP 7 " 



probability 
Oe^ 



Description 

pir : [LN:A64533] [AC:A64533 ] [PN : conserved hypothetical protein HP0105] [CL : conserved 
hypothetical protein HI0491] [OR : Helicobacter pylori] [DB:pir2] >gp : [GI :g2313188] 
[LN:AE000532] [AC : AE000532 : AE000511] [PN : conserved hypothetical protein] [GN:HP0105] 
[OR Helicobacter pylori 26695] [DB :genpept-bct2] [DE : Helicobacter pylori 26695 section 
10 of 134 of the complete genome.] [NT:similar to GB:L42023 SP:P44007 PID:1003866] 
[LE : 5494] [RE : 5961] [DI : complement] 



ORF Name 



UVb03000979 25757577 c2 408 



NTID 



1ST 



NT 



AA 



^ Le^thLemth ^ probability 



J [3857 ||1377 1 



2153 ||4.6e-224 



Description 

gp: [GI:el352473 :g3892895] [LN : SAARGFEMD] [AC:Y15477] [PN : phosphoglucosamine -mutase] 
[GN:glmM] [OR : Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus argl, 
glmM genes and ORF1 and ORF2 . ] [LE:2968] [RE: 4323] [DI: direct] 

>gp: [GI:e283110:gl684749] [LN : SAFEMD] [AC:Y09570] [GN:femD] [OR : Staphylococcus aureus] 
[DB: genpept-bctl] [DE:S. aureus femD gene.] [LE:511] [RE:1866] [DIrdirect] 



ORF Name 



NTID AAID 



IAI7B03000979 2775452 cl 394 



F5~ 



NT AA 
LengthLength 



score 



TTT 



probability 
5.2e-30 



Description 

gp: [GI:el340259:g3850849] [LN : SAU16431] [AC:Y16431] [PN : hypothetical protein] [GN:ORF4] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF's.] [LE:2477] [RE: 2971] [DI: direct] 



ORF Name 



x-rm-r^ _ NT AA 

NTID AAID — . — , . score 
LengthLength 



probability 



[AI7503000979 2925275 tl 20 



j [3859 |p^T 1 



Description 
NO-HIT 



ORF Name 



AI7B03000979_293073i2_c3_470 

Description 

NO-HIT 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



J [88 | [3850 ||155 | [54 ~[ 
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ORF Name 



NT ID 



NT AA 

^ Le ^thLe^th ^^ probability 



1AI7503000979 2%9b2b2 c3 49 7 



] 1" | | 3861 II 1098 IP 65 II 665 p.be-fiB 



Description 

sp: [LN:YWLCJBACSU] [AC:P39153] [GN : YWLC : IPC-29D] [OR : BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 37.0 KD PROTEIN IN SPOIIR-GLYC INTERGENIC REGION] [SP:P39153] 
[DBrswissprot] >pir : [LN: 140476] [AC : 140476 : H70061 : S49358 ] [PN : conserved hypothetical 
protein ywlC:SUA5 homolog ipc-29d] [GN : ywlC : ipc-29d ] [CL:Bacillus subtilis conserved 
hypothetical protein ywlC] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g556881] 
[LN:BSSPORUPP] [AC:Z38002] [PN: Similar to Saccharomyces cerevisiae SUA5] [GN:ipc-29d] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis spoII-R, glyC and upp genes.] 
[SP:P39153] [LE:1927] [RE:2967] [DI:direct] >gp : [GI : ell84601 :g2636220] [LN: BSUB0019] 
[AC:Z99122:AL009126] [GN:ywlC] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: alternate gene name: ipc-29d; similar to] [SP:P39153] [LE: 194934] [RE: 195974] 
[DI: complement] >gp : [GI : el 184601 :g2 636220] [LN: BSUB0019] [AC : Z99122 : AL009126] [GN:ywlC] 
[FN: unknown] [OR: Bacillus subtilis] [DB:genpept] [DE: Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: ipc-29d; similar 
to] [SP:P39153] [LE: 194934] [RE: 195974] [DI : complement] 



ORF Name 



IAI7503000979 29979407 c2 410 



Description 
NO-HIT 



NT ID 



^ m NT AA 

AAID — ^ UT — , . score 

LengthLength 



[3862 



1189 I [57 



probability 



ORF Name 



NT ID 



„ TT% NT AA 

AAID — — _ score 

LengthLength 



probability 



^17503000979 30682816 cl 367 



] |91 | [3863 ||537 | [178 | [77r~||l . le-34 



Description 

Sp : [ LN : AT P F_B ACME ] [AC:P20601] [GN : ATPF] [OR:BACILLUS MEGATERIUM] [EC : 3 . 6 . 1 . 34 ] [DE : ATP 
SYNTHASE B CHAIN,] [SP:P20601] [DB : swissprot] >pir : [LN:D31482] [AC:D31482 ] 
[PN:H+-transporting ATP synthase, chain b] [CL :H+- transporting ATP synthase chain I] 
[OR:Bacillus megaterium] [EC : 3 . 6 . 1 . 34] [DB:pir2] >gp : [GI :gl42557] [LN : BACATPA] 
[AC:M20255: J04455 : M18352 :M23924] [PN:ATP synthase b subunit] [OR:Bacillus megaterium] 
[SR:B. megaterium (QM B1551) DNA, clones pWSBlOO, pCAH1.3, and pWPC208] 
[DB :genpept-bctl] [DE:B. megaterium ATP synthase i , a, c , b, delta, alpha, gamma, beta 
andepsilon subunit genes, complete cds, and ORF.] [LE:1777] [RE: 2295] [DI: direct] 



ORF Name 



_ _ ^ NT AA 
NTID AAID — — score 
LengthLength 



E 



17503000979 31637 c2 406 



] [92 [ [3864 |pT~-||296 ||T77T 



probability 
4.3e-125 



Description 

gp: [GI:el352471:g3892893] [LN : SAARGFEMD] [AC:Y15477] [PN : hypothetical protein] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus argl, glmM genes 
and ORF1 and ORF2 . ] [NT:orfl] [LE:1198] [RE: 2007] [DI: direct] 



104 



NT AA , , . , . 

ORF Name NTID AAID Le ^ t frLength score probability 



AI 7 5030009 7 9_:43412800_c2_429 | | 93 | [ 3865 |p^~[ | p5~ || i . ^e-08 



Description 

pir : [LN:B69595] [AC:B69595 ] [PN : spermine /spermidine acetyltransf erase bltD] [GNibltD ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013033 : gl303698] [LN:BACJH642] 
[AC:D84432 :D82370] [PN : BltD] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DErBacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE:811] [RE: 1269] [DI : complement] 

>gp: [GI :ell83889:g2 635105] [LN : BSUB0014] [AC : Z99117 : AL009126] [PN: spermine/spermidine 
acetyltransferase] [GN : bltD] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.3.1.-] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT alternate gene name: bmr2D, bmtD] [SP:P39909] [LE:118163] [RE:118621] [DI:direct] 



ORF Name 


NTID 


7v7VTr , NT AA 

AAID — 

Length Length 


score 


probability 


AI7503000979_3361326_c2_403 


II 94 


| |3866 ||207 | |58 






Description 
NO-HIT 










ORF Name 


NTID 


NT AA 

AAID 

LengthLength 


score 


probability 


AI7503000979_33673776_c2_435 


IP 


|3867 297 1 198 j 


pu f 


.4e-28 | 



Description 

gp: [GI:g4193373] [LN : AF072894] [AC : AF072894] [PN : ribosomal protein L31] [GN: rpmE] 
[OR: Listeria monocytogenes] [DB : genpept-bct2] [DE: Listeria monocytogenes 4bl putative 
transcription terminator Rho(rho) gene, partial cds; and wall teichoic acid 
glycosylationprotein GtcA (gtcA) and ribosomal protein L31 (rpmE) genes , complete cds.] 
[LE:848] [RE:1093] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength 



IAI7503000979 337bl260 cl 372 1 |5B 1 [3868 | |471 | | i56 ||" 448 | | 2.i>e-42 



Description 

pir: [LN:D70065] [AC:D70065 ] [PN: (3R) -hydroxymyristoyl- [acyl carrier protein] 
dehydratase, ywpB] [GN:ywpB ] [CL : (3R) -hydroxymyristoyl - [acyl carrier protein] 
dehydratase] [OR:Bacillus subtilis] [EC: 4.2.1.-] [DB:pir2] >gp : [GI : ell84543 :g2636162] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [GN:ywpB] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 19 of 21) : from 
3597091to 3809700.] [NT: similar to hydroxymyristoyl- (acyl carrier protein)] [LE: 145724] 
[RE:146122] [DI : complement ] >gp : [GI : e289141 :gl763703] [LN :BSZ83337] [AC:Z83337] 
[GN:ywpB] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis mbl, flh[0,P], rapD, 
ywp[B,C,D,E,F,G,H, I, J] and ywqAgenes . ] [NT:similar to hydroxymyristoyl- (acyl carrier 
protein)] [LE:3199] [RE:3597] [DI:direct] >gp : [GI : ell84543 :g2636162] [LN : BSUB0019] 
[AC:Z99122:AL009126] [GN :ywpB] [FN:unknown] [OR:Bacillus subtilis] [DB:genpept] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: similar to hydroxymyristoyl- (acyl carrier protein)] [LE: 145724] [RE: 146122] 
[DI : complement] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength 



AI7503000979_338703i2_c3_522 | |97 | [ 3859 |[7^"| | 2^3 | pTT] [2 . le-123 



Description 

gp: [GI:e279935:gl934991] [LN : SASIGFACB] [AC:Y07645] [PN:sigma factor B] [GN: sigB] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE:S. aureus sigB gene.] [LE:2674] 
[RE:3444] [DI: direct] >gp : [GI : e285000 : gl729797] [LN:SAUSIGB] [AC:Y09929] [PN:sigma-B] 
[GN: sigB] [OR : Staphylococcus aureus] [DB : genpept-bctl] [DE:S. aureus rsbU, rsbV, rsbW & 
sigB genes.] [LE:2687] [RE:3457] [DI:direct] 
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ORF Name 



NT ID 



NT AA 
AAID — « _ — , , score 
LengthLength 



probability 



1AI 7 503000979 340k2928 c3 521 



] I 95 I P 870 ||10B5"]|360 |p?T]|7.7e-135 



] 



Description 

gp: [GI:el340261:g3850851] [LN: SAU16431] [AC:Y16431] [PNralr protein] [GN:alr] 
[OR: Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF's.] [LE:3400] [RE: 4548] [DI: direct] 



ORF Name 



NTID 



„ TO NT AA 

AAID — _ — . , score 

LengthLength 



IAI7503000979 34181277 c2 451 



[3571 



probability 
1.5e-26 



Description 

gp: [GI :el340257:g3850847] [LN : SAU16431] [AC:Y16431] [PN : hypothetical protein] [GN: ORF2 ] 
[OR: Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF's.] [LE:438] [RE: 917] [DI: direct] 



ORF Name 



^17503000979 34187702 c2 407 



NTID 



purr 



_ _ _ _ NT AA 

AAID — _ T — , , score 

LengthLength 



probability 



[3872 



1936 I [ITT 



]EZH 



1173 l|3.7e-119 



Description 

gp: [GI:el352472:g3892894] [LN : SAARGFEMD] [AC:Y15477] [PN : hypothetical protein] 
[GN:orf2] [OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus argl, 
glmM genes and ORF1 and ORF2 . ] [LE:2009] [RE: 2941] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — - — . , score 
LengthLength 



probability 



^17503000979 34197275 12 176 



2 [101 | [3873 |p^~l |41 | 



Description 
NO-HIT 



ORF Name 



NTID 



„ Tr , NT AA 

AAID — — .score 

LengthLength 



^17503000979 34250327 ci 395 



probability 
|4.5e-52 



Description 

gp: [GI:e279931:gl934987] [LN : SASIGFACB] [AC:Y07645] [GN:ORFl] [OR : Staphylococcus 
aureus] [DB : genpept-bctl] [DE:S. aureus sigB gene.] [LE:72] [RE:434] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ — , score 
Length Length 



probability 



|A17503000979_34589010_cl_348 

Description 

NO-HIT 



J |103 | [3875 1|81 | 



106 



NT AA 

QRF Name MI2 Le^thLe^th ^^ probability 



|AI7503000979_J4bll067_i3^302 | |104 | [3876 ||540 1 |179 ||145 |p.2e-llT 

Description 

sp: [LN:YWJG_BACSU] [AC:P06629] [GN: YWJG] [ORrBACILLUS SUBTILIS] [DE : HYPOTHETICAL 19.1 

KD PROTEIN IN SPOOF-PYRG INTERGENIC REGION (ORFS) ] [SP:P06629] [DB : swissprot] 
>pir : [LN:I40471] [AC : 140471 : S55424 : E7006 0 ] [PN : hypothetical protein ywjGrspoOF 

protein] [GN : ywj G : spoOF ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :g853763] 
[LN:BSDNA320D] [AC:Z49782] [GN : ywjG] [ORrBacillus subtilis] [DB r genpept-bctl] 
[DE:B. subtilis chromosomal DNA (region 320-321 degrees).] [SP:P06629] [LErlll69] 
[RE:11690] [DI : complement] >gp : [GI : g40177] [LN:BSSPOO] [AC : V00105 : JO 154 9] [GN:spoOF] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis gene required at an 

early stage of sporulation. (gene code spoOF).] [SP:P06629] [LE:541] [RE:1062] 
[DIrdirect] >gp : [GI :ell84620:g2 636239] [LN: BSUB0019] [AC : Z99122 : AL009126] [GN : ywjG] 
[FN:unknown] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete 

genome (section 19 of 21) r from 3597091to 3809700.] [SPrP06629] [LEr212053] [REr212574] 
[DIrdirect] >gpr [GI rell86215:g2 636251] [LN r BSUB0020] [AC : Z99123 r AL009126] [GNrywjG] 
[FNrunknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete 

genome (section 20 of 21) r from 3798401to 4010550.] [SPrP06629] [LErl0743] [RErll264] 
[DIrdirect] >gpr [GI rell84620rg2 636239] [LN r BSUB0019] [AC r Z99122 r AL009126] [GNrywjG] 
[FNrunknown] [ORrBacillus subtilis] [DBrgenpept] [DErBacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [SPrP06629] [LEr212053] [REr212574] 
[DIrdirect] 

NT AA 

QRF Name NT ID A AID — _ — _ score probability 
LengthLength 

&I7S03000979_3461B700_t2J.93 | |105 | [3877 ||5T5— ] |272 | 



155 ||2.8e-08 



Description 

gpr [GI rgl293846] [LN r CELC42D8] [ACrU56966] [GNrC42D8.3] [OR r Caenorhabditis elegans] 
[SRrCaenorhabditis elegans strain^Bristol N2] [DB r genpept - invl ] [DE : Caenorhabditis 
elegans cosmid C42D8.] [NTrcoded for by C. elegans cDNA yk30b3.5; coded for by] 
[LE r 8 907 r 93 77 r 9844 r 10820] [RE r 9056 r 9564 r 102 06 r 10976 ] [DI r direct Join] 



NT AA 

QRF Name NT ID AAID — _^ — _ score prob ability 
LengthLength 



[AI7503000579_34646926_t2_20g | [106 | [3878 | 243 | |555 |[77S e-i>7 

Description 

Sp r [LN r DEOD_ACTPL] [ACrP94164] [GNrDEOD] [OR r ACTINOBACILLUS PLEUROPNEUMONIAS] 
[SRr , HAEMOPHILUS PLEURO PNEUMONIAE] [ECr2.4.2.1] [DE r (PNP) ] [SPrP94164] [DB r swissprot] 
>gpr [GI rgl732037] [LN r APU55016] [ACrU55016] [PNrpurine nucleoside phosphorylase] 
[GNrdeoD] [FNr cleavage of guanosine or inosine to respective] [OR : Actinobacillus 
pleuropneumoniae] [DB rgenpept-bctl] [ECr 2 .4.2.1] [DE : Actinobacillus pleuropneumoniae 
heat-shock 10 protein GroES (mopB) , heat- shock 60 protein GroEL (mopA) , purine 
nucleoside phosphorylase (deoD) genes, complete cds, alcohol dehydrogenase (adhE) 
gene, partial cds.] [LErl76] [REr898] [DIrdirect] 

NT AA 

QRF Name NTID AAID — _ „. — _ score probabi lity 
LengthLength J ~ 



AI7503000979_353546E;6_rl_108 | |107 | [3879 | p^~| |7 B | 



Description 
NO-HIT 



107 



NT AA 

ORF Name NTID AAI5 Le ^thLe^th ^^ probability 

|AI7503000979_3b64778J_ti_103 | [108 | [ 3880 |p3~| |229 | pi~ ][ 4 . 9e-14 J 

Description 

pir: [LN:F71082] [AC:F71082 ] [PN : hypothetical protein PH0924] [GN:PH0924 ] 
[OR : Pyrococcus horikoshii] [DB:pir2] >gp : [GI :dl030963 :g3257337] [LN: AP000004] 
[AC:AP000004 :AB0094 94 :AB0094 95 :AB009496 : AB0094 97 : AB0094 98 : AB0094 99] [PN: 12 8aa long 

hypothetical protein] [GN:PH0924] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain :OT3) DNA] [DB : genpept - be t 1 ] [DE : Pyrococcus horikoshii OT3 genomic DNA, 

777001-994000 nt. position (4/7 ). ] [LE:55768] [RE:56154] [DI:direct] 



NT AA 

ORF Name NT ID AAID — — t , score probability 
— — — Length Length ^ — — 



AI7503000979_36i28785_cl_3(;9 | [109 | [3881 | [535~|| 295 | |796 | | 3.3e-79 



Description 

Sp : [ LN : ATPG_BACME ] [AC:P20602] [GN : ATPG] [OR:BACILLUS MEGATERIUM] [EC : 3 . 6 . 1 . 34] [DE : ATP 
SYNTHASE GAMMA CHAIN, ] [SP:P20602] [DB : swissprot] >pir : [LN : G31482] [AC:G31482 ] 
[PN:H+-transporting ATP synthase, gamma chain] [CL :H+- transporting ATP synthase gamma 
chain] [OR : Bacillus megaterium] [EC : 3 . 6 . 1 . 34] [DB:pir2] >gp : [GI : gl42560] [LN : BACATPA] 
[AC:M20255: J04455 :M18352 :M23924] [PN:ATP synthase gamma subunit] [OR:Bacillus 
megaterium] [SR : B .megaterium (QM B1551) DNA, clones pWSBlOO, pCAHl . 3 , and pWPC208] 
[DB:genpept-bctl] [DE:B. megaterium ATP synthase i , a, c, b, delta, alpha, gamma, beta 
andepsilon subunit genes, complete cds, and ORF.] [LE:4467] [RE:5324] [DI:direct] 



ORF Name 


NT AA 
NT ID AAID — , — , score 
LengthLength ■ 


probability 


AI7S03000979_3S225052_c3_502 


| |110 | |3882 |234 ||77 ||270 ||1 


.Ue-23 | 



Description 

pir : [LN: 139786] [AC:I39786 ] [PN:H+- transporting ATP synthase, C chain] [GN: atpE ] 
[CL:H+- transporting ATP synthase lipid-binding protein] [OR:Bacillus firmus] 
[EC:3.6.1.34] [DB:pir2] >gp : [GI : gl42570] [LN : BACATPSYNB] [AC:M84713] [PN: ATP synthase 
subunit] [GN: atpE] [FN:proton translocation] [OR:Bacillus firmus] [SR:Bacillus firmus 
DNA] [DB : genpept -bet 1] [DE:Bacillus firmus ATP synthase a and c subunit genes, 3' end 
andcomplete cds.] [NT : putative] [LE:299] [RE: 511] [DI: direct] 

NT AA 

ORF Name NT ID AAID — _ — , score probability 
LengthLength c *- 

[&I7503000579_36363432_tl_95 | [ 111 | [3883 | p2~ | p — 

Description 
NO-HIT 



ORF Name 



NT 



AA 



NT ID AAID — . 



LengthLength 



score 



[ttt 



[3884 1|55 Im- 



probability 




AI7503000979_4039012_c3_514 

Description 

gp: [GI:g5410434] [LN : AF134170] [AC : AF134170] [PN:gag] [OR : Dictyostelium discoideum] 
[DB: genpept] [DE : Dictyostelium discoideum retrotransposon TRE3-B, complete sequence.] 
[LE:139] [RE: 1347] [DI: direct] 
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ORF Name 



NTID 



NT 



AA 



AAID „ m _ _ 

LenqthLength 



score 



probability 



1AI 7 503000979 406^20 2 c3 499 



] 1 | 3885 |P 3 9||412 ||ib63 |[1.8e-lW 



1 



Description 

sp: [LN:GLYA_BACSU] [AC:P39148] [GN : GLYA : GLYC : IPC-34D] [OR : BACILLUS SUBTILIS] 
[EC:2. 1.2.1] [DE : (SHMT) ] [SP:P39148] [DB : swissprot] >pir : [LN: 140483] 
[AC : 140483 :H69635 : S49363 ] [PNrglycine hydroxymethyl trans f erase , glyA: serine 
hydroxymethyltransf erase] [GN:glyA:glyC ] [CL: glycine hydroxymethyl transferase] 
[OR:Bacillus subtilis] [EC:2. 1.2.1] [DB:pir2] >gp : [GI :g556886] [LN : BSSPORUPP] 
[AC:Z38002] [PN: serine hydroxymethyltransf erase] [GN:glyC] [OR: Bacillus subtilis] 
[DB: genpept-bctl] [DE : B . subtilis spoII-R, glyC and upp genes.] [SP:P39148] [LE:5499] 
[RE:6746] [DI: direct] >gp : [GI:ell84596:g2 636215] [LN: BSUB0019] [AC : Z99122 : AL009126] 
[PN: serine hydroxymethyltransf erase] [GN : glyA] [FN : glycine/serine/threonine metabolism] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2. 1.2.1] [DE:Bacillus subtilis complete 
genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: glyC, 
ipc-34d] [SP:P39148] [LE:191155] [RE:192402] [DI : complement] >gp: [GI : ell84596 :g2636215] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [PN: serine hydroxymethyltransf erase] [GN : glyA] 
[FN: glycine/serine/threonine metabolism] [OR:Bacillus subtilis] [DB:genpept] 
[EC:2. 1.2.1] [DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 
3809700.] [NT:alternate gene name: glyC, ipc-34d] [SP:P39148] [LE:191155] [RE:192402] 
[DI complement] 



ORF Name 



NT AA 
NTID AAID — .„ — , , score 
LengthLength 



probability 



F 



7503000979 40712 ±2 210 



Description 
NO-HIT 

ORF Name 



] |114 | [3886 |pT5~||71 | 



NTID 



7V7VTT> NT AA 

AAID — , -i _. — . , score 

LengthLength 



probability 



|AI7S03000979_4072I3S_c3_490 



][ 



TIT 



[3887 |pT^T]|488 ||870 ||4.8e-87 



Description 

pir: [LN:E70961] [AC:E70961 ] [PN : hypothetical protein Rv0223c] [GN:Rv0223c ] 
[OR Mycobacterium tuberculosis] [DB:pir2] >gp : [GI : e304956 :gl871596] [LN :MTCY8D5] 
[AC:Z92669:AL123456] [PN : hypothetical protein Rv0223c] [GN:Rv0223c] [OR Mycobacterium 
tuberculosis] [DB : genpept-bctl] [DE : Mycobacterium tuberculosis H3 7Rv complete genome; 
segment 12/162.] [NT:Rv0223c / (MTCY08D5 . 18 ) , aldehyde dehydrogenase,] [LE:18979] 
[RE : 2 0442] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — - — score 
LengthLength 



probability 



IAI7503000979 4079511 tl 3 



] | [3888 ||147 1 148 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



E 



17503000979 409555 i± 111 



TT7" 



NT AA 
LengthLength 



score 



44 



TIT 



probability 



Description 

pir: [LN:D71245] [AC:D71245 ] [PN : hypothetical protein PH0221] [GN:PH0221 ] 
[OR : Pyrococcus horikoshii] [DB:pir2] >gp : [GI : d!030234 :g3256608] [LN: AP000001] 
[AC:AP000001 :AB009465 :AB009464 :AB009466 :AB00946 7 :AB009468 :AB00946 9] [PN:235aa long 

hypothetical protein] [GN:PH0221] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB : genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 1-287000 

nt. position (1/7).] [LE:194212] [RE:194919] [DI : complement] 
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NT AA 

ORF Name NTID AAID Le ^ t h Le ^ tn SCQr e probability 



AI7503000979_41782m_cl_J5J | | 118 | [ 3890 | pT^ | 224 [pi || 0.0055 



Description 

gp: [GI:gll31502] [LN: PBU42580] [AC : U4 2 5 8 0 : Ul 7055: U3 2570] [GN:A158L] [OR : Paramecium 
bursaria Chlorella virus 1] [DB : genpept - vrl ] [DE : Paramecium bursaria Chlorella virus 1, 
complete genome.] [LE : 80789] [RE: 81103] [DI : complement] 

NT AA 

ORF Name NTID AAID — . „ — , , score probability 
LengthLength c L - 

|119 [ [3891 |[m~||40 | 



H7S03000979 434S926 c2 450 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 
|AI7503000979_ 4728558 c2 409 1 1 [ 3892 | [r^TT|| 519 |p88 | | 4 . le-216 



Description 

pir: [LN:B69633] [AC:B69633 ] [PN : L-glutamine-D- fructose- 6 -phosphate amidotransf erase 
glmS] [GN:glmS ] [CL :glutamine- -f ructose- 6 -phosphate aminotransferase (isomerizing) ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl034037 :g3599596] [LN : AB006424] [AC : AB006424] 
[PN:L-GLUTAMINE-D-FRUCTOSE- 6 -PHOSPHATE AMIDO] [GN:gcaA] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain:168) DNA] [DB : genpept-bctl] [DErBacillus subtilis genomic 
DNA, 70 kb region between 17 and 23degree.] [LE:3246] [RE: 5048] [DI: direct] 
>gp: [GI:g726480] [LN : BSU21932 ] [AC : U21932 : D21198 ] 

[PN: L-glutamine-D- fructose -6 -phosphate] [GN:gcaA] [OR: Bacillus subtilis] 
[DB : genpept-bctl] [DE :Bacillus subtilis 

L-glutamine-D-f ructose-6-phosphateamidotransf erase (gcaA) gene, complete cds.] [LE:312] 
[RE:2114] [DI: direct] >gp : [GI:ell82111:g2632445] [LN : BSUB0001] [AC : Z99104 : AL009126 ] 
[PN : L-glutamine-D- fructose- 6 -phosphate] [GN : glmS] [OR : Bacillus subtilis] 
[DB:genpept-bctl] [EC : 2 . 6 . 1 . 16] [DE:Bacillus subtilis complete genome (section 1 of 

21): from 1 to213080.] [NT : alternate gene name: gcaA, ybxD] [SP:P39754] [LE:200263] 
[RE:202065] [DI:direct] >gp : [GI :ell82129:g2 632463] [LN: BSUB0002] [AC : Z99105 : AL009126] 
[PN: L-glutamine-D- fructose -6 -phosphate] [GN:glmS] [OR: Bacillus subtilis] 
[DB: genpept-bctl] [EC : 2 . 6 . 1 . 16] [DE:Bacillus subtilis complete genome (section 2 of 

21): from 194651 to415810.] [NT : alternate gene name: gcaA, ybxD] [SP:P39754] [LE:5613] 
[RE : 7415] [DI : direct] 



ORF Name 


NT AA 

NTID AAID . — . , , — . , score probability 
LenyLh Length ^ *- 


|A17!i03000979_476567_cl_355 


121 |3893 ||567 1 1188 292 8.5e-25 | 



Description 

sp: [LN:RPOE_BACSU] [AC:P12464] [GN : RPOE] [OR:BACILLUS SUBTILIS] [EC:2.7.7.6] 
[DE: DNA- DIRECTED RNA POLYMERASE DELTA SUBUNIT, ] [SP:P12464] [DB : swissprot] 
>pir: [LN: JT0302] [AC : JT0302 :H32354 : S55422 :H69698 ] [PN : DNA- directed RNA polymerase, 
delta chain rpoE] [GN: rpoE ] [CL : DNA-directed RNA polymerase delta chain] [OR: Bacillus 
subtilis] [EC:2.7.7.6] [DB:pirl] >gp : [GI :gl43456] [LN: BACRPOE] [AC:M21677] [OR:Bacillus 
subtilis] [SR:B. subtilis (strain 168) DNA, clone mMLll] [DB : genpept-bctl] 
[DE :B. subtilis RNA polymerase delta subunit (rpoE) gene, complete cds.] [NT : rpoE 
protein (ttg start codon) ] [LE:101] [RE:622] [DI:direct] >gp : [GI :g853761] 
[LN : BSDNA32 0D] [AC:Z49782] [PN : RNA polymerase delta subunit] [GN : rpoE] [OR:Bacillus 
subtilis] [DB : genpept-bctl] [DE : B . subtilis chromosomal DNA (region 320-321 degrees).] 
[NT: ttg start codon] [SP:P12464] [LE:8717] [RE:9238] [DI:direct] 

>gp: [GI:ell86217:g2636253] [LN : BSUB0020] [AC : Z99123 : AL009126] [PN: RNA polymerase (delta 
subunit)] [GN: rpoE] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.7.7.6] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [SP:P12464] 
[LE : 13195] [RE : 13716] [DI : complement] 



110 



ORF Name 

A17503000979_48fc9213_c3_bi9 



NTID 



NT AA 

^ Le^thLe^th ^ probability 



] I 122 | | 3flS4 I F^HP 6 |pn~|| i.4e-128 



Description 

gp: [GI:el340258:g3850848] [LN: SAU16431] [AC:Y16431] [PN : hypothetical protein] [GN:0RF3] 
[OR: Staphylococcus aureus] [DB :genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF's.] [LE:910] [RE: 2493] [DI: direct] 



ORF Name 



[AJ7503000979 4881252 t3 293 



NTID 



TTT 



7V7VTT ^ NT AA 

AAID — « — , , score 

LengthLength 



probability 



Description 
NO-HIT 



|3895 |[T7I | 



[FT 



ORF Name 



IAI7503000979 4881302 t2 191 



NTID 



Description 
NO-HIT 



NT AA 
AAID — 4 _, T — . score 
LengthLength 

|3896 |pTH"D 1 149 | 



probability 



ORF Name 



NTID 



NT 



AA 



AAID — , 

LengthLength 



score 



E 



17503000979 4901712 ci 370 



T7T 



[3897 |p£35~| |575~ 



probability 
i.ie-20S 



Description 

sp: [LN:ATPB_BACSU] [AC:P37809] [GN : ATPD] [OR:BACILLUS SUBTILIS] [EC : 3 . 6 . 1 . 34] 
[DE: (VEG31) ] [SP:P37809] [DB : swissprot] >pir : [LN : 140368] [AC : 14 036 8 :H69591 : S3 92 56 ] 
[PN:H+- transporting ATP synthase, beta chain (atpD) ] [GN: atpD ] [CL :H+- transporting 
ATP synthase alpha chain: H+- transporting ATP synthase alpha chain homology] 
[OR:Bacillus subtilis] [EC: 3 . 6 . 1 . 34] [DB:pir2] >gp : [GI :g433991] [LN : BSATPASE] 
[AC:Z28592] [PN:ATP synthase subunit beta] [GN: atpD] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis (168) atpase genes for ATP synthase subunits i, a, c 
,b, delta, alpha, gamma, beta, epsilon.] [SP:P37809] [LE:5520] [RE:6941] [DI:direct] 
>gp : [GI : ell84587 : g2636206] [LN : BSUB0019] [AC : Z99122 : AL009126 ] [PN:ATP synthase (subunit 
beta)] [GN:atpD] [OR: Bacillus subtilis] [DB :genpept-bctl] [EC : 3 . 6 . 1 . 34] [DE:Bacillus 
subtilis complete genome (section 19 of 21): from 3597091to 3809700.] [SP:P37809] 
[LE:183456] [RE:184877] [DI : complement] >gp : [GI :ell84587:g2 636206] [LN : BSUB0019] 
[AC:Z99122:AL009126] [PN:ATP synthase (subunit beta)] [GN : atpD] [OR:Bacillus subtilis] 
[DB:genpept] [EC : 3 . 6 . 1 . 34] [DE:Bacillus subtilis complete genome (section 19 of 21): 
from 3597091to 3809700.] [SP:P37809] [LE:183456] [RE:184877] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



IAI7503000979 49S2802 c2 421 



] I 126 | | 389fl |P^|434 |p^T 



score probability 
1. 9e-163 



Description 

gp: [GI :g4336793] [LN: AF105341] [AC : AF105341] [PN:pyrimidine nucleoside phosphorylase] 
[GN:pdpl] [OR:Listeria monocytogenes] [DB : genpept-bct2] [DE:Listeria monocytogenes 
threonine dehydratase (thdl) gene, partialcds; alpha acetolactate decarboxylase gene, 
complete cds; andpyrimidine nucleoside phosphorylase (pdpl) gene, partial cds . ] 
[LE : 1749] [RE : >3010] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ T — , , score probability 
LengthLength c J - 



M7S03000979_S078177_cl_36B | |127 | [3899 || 10g2 ^| 353 |pTgT[| 9. 9e-li9 



Description 

gp: [GI:gl773355] [LN : SAU81973 ] [AC:U81973] [PN:Cap5P] [GN:cap5P] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus capsule gene cluster CapSA through 
CapSPgenes, complete cds.] [NT:putative N-acetylglucosamine 2-epimerase] [LE:15769] 
[RE : 16944] [DI : direct] 



111 



ORF Name 



NTID 



NT 

AAID _ 



AA 



LengthLength 



score 



LAI7503000979 511150U c2 454 



] I 128 1 1 3500 II 924 IP 07 IF 17 " 



probability 
4.8e-39 



Description 

sp: [LN:YQJG_BACSU] [AC:P54544] [GN: YQJG] [ORrBACILLUS SUBTILIS] [DE : PRECURSOR] 
[SP:P54544] [DB : swissprot] >pir : [LN: G69963] [AC:G69963 ] [PN: lipoprotein SpoIIIJ-like 
homolog yqjG] [GNryqjG ] [CL: stage III sporulation protein : stage III sporulation 
protein homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013293 :gl303958] 
[LN:BACJH642] [AC :D84432 :D82370] [PN : YqjG] [OR:Bacillus subtilis] [SRrBacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 234919] [RE: 235746] [DI : complement] 
>gp: [GI:ell85657:g2634823] [LN : BSUB0013] [AC : Z99116 : AL009126] [GN:yqjG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NTrsimilar to lipoprotein SpoIII J-like] 
[SP:P54544] [LE:87873] [RE:88700] [DI:direct] 



ORF Name 



IA17503000979 5131927 ti 93 



Description 
NO-HIT 



_ _i_ _ „ TO NT AA 

NTID AAID — _ T — , , score 
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Description 

Sp : [LN : ATPE_BACSU] [AC:P37812] [GN : ATPC] [OR: BACILLUS SUBTILIS] [EC : 3 . 6 . 1 . 34] [DE : ATP 
SYNTHASE EPSILON CHAIN,] [SP:P37812] [DB : swissprot ] >pir : [LN : 140369] 
[AC:I40369 :G69591 :S39257 ] [PN : H+- transporting ATP synthase, epsilon chain (atpC) ] 
[GN:atpC ] [CL:H+- transporting ATP synthase epsilon chain] [OR: Bacillus subtilis] 
[EC:3.6.1.34] [DB:pir2] >gp : [GI : g433 992 ] [LN: BSATPASE] [AC:Z28592] [PN: ATP synthase 
subunit epsilon] [GN:atpC] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis 
(168) atpase genes for ATP synthase subunits i, a, c ,b, delta, alpha, gamma, beta, 
epsilon.] [SP:P37812] [LE:6965] [RE:7363] [DI:direct] >gp : [GI : ell84586 :g2636205] 
[LN:BSUB0019] [AC : Z 99122 : AL009126] [PN: ATP synthase (subunit epsilon) ] [GN:atpC] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC: 3.6.1. 34] [DErBacillus subtilis complete 
genome (section 19 of 21): from 3597091to 3809700.] [SP:P37812] [LE:183034] [RE:183432] 
[DI: complement] >gp: [GI :el!84 586 :g2 636205] [LN : BSUB0019] [AC : Z99122 : AL009126] [PN:ATP 
synthase (subunit epsilon)] [GN:atpC] [OR:Bacillus subtilis] [DB:genpept] [EC: 3 . 6 . 1 . 34] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[SP:P37812] [LE:183034] [RE:183432] [DI : complement] 
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Description 

gp: [GI:el340262:g3850852] [LN: SAU16431] [AC:Y16431] [PN : hypothetical protein] [GN : ORF7] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus dp j , air genes, 
partial kdpC gene and 40RF's.] [LE:4633] [RE:>4735] [DI: direct] 
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Description 

Sp : [LN: THID_HAEIN] [AC:P44697] [GN: THID : HI 04 16] [OR : HAEMOPHILUS INFLUENZAE] 
[EC:2.7.4.7] [DE:(HMP-P KINASE)] [SP:P44697] [DB : swissprot] >pir : [LN : 164151] [AC:I64151 
] [PN: hypothetical protein HI0416] [CL : phosphomethylpyrimidine phosphate kinase] 
[OR: Haemophilus influenzae] [DB:pir2] >gp : [GI :gl573390] [LN:U32725] [AC :U32725 : L42023] 
[PN: phosphomethylpyrimidine kinase (thiD)] [GN:HI0416] [OR Haemophilus influenzae Rd] 
[DB :genpept-bct2] [DE Haemophilus influenzae Rd section 40 of 163 of the complete 
genome.] [NT:similar to GB:AE000511 PID:2313975 percent] [LE:1472] [RE:2281] 
[DI:direct] 
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Description 

sp : [LN : ALF1_BACSU] [AC:P13243] [GN : FBAA : FBA : FBA1 : TSR] [OR:BACILLUS SUBTILIS] 
[EC:4.1.2.13] [DE : PROBABLE FRUCTOSE -BISPHOSPHATE ALDOLASE 1,] [SP:P13243] 
[DB: swissprot] >pir: [LN:D32354] [AC : S55426 :D32354 : E32354 :D41835 : B69621 ] 
[PN: f ructose-bisphosphate aldolase, fbaA:3 0K phosphoprotein 

orfY-tsr : f ructose-1, 6-bisphosphate aldolase] [GN:fbaA ] [CL : f ructose-bisphosphate 
aldolase II] [OR:Bacillus subtilis] [EC : 4 . 1 . 2 . 13] [DB:pirl] >gp : [GI :g460911] 
[LN:BACSPO0FA] [AC:M22039] [PN: f ructose-bisphosphate aldolase] [GN:orfY-tsr] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strains JH642 and UOT0550) DNA] 
[DB:genpept-bctl] [EC : 4 . 1 . 2 . 13] [DE:Bacillus subtillis spoOF, CTP synthetase (ctrA) , 
andf ructose-bisphosphate aldolase (orfY-tsr) genes, complete cds . ] [LE:3270] [RE:4127] 
[DI:direct] >gp : [GI : g853765] [LN: BSDNA32 0D] [AC:Z49782] [PN:fructose biphosphate 
aldolase] [GN:tsr] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis chromosomal 
DNA (region 320-321 degrees) . ] [SP:P13243] [LE:12411] [RE:13268] [DI:direct] 
>gp: [GI:ell84618:g2636237] [LN:BSUB0019] [AC : Z99122 : AL009126] 

[PN:fructose-l,6-bisphosphate aldolase] [GN:fbaA] [FN glycolysis] [OR: Bacillus 
subtilis] [DB : genpept-bctl] [EC : 4 . 1 . 2 . 13] [DE: Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: tsr, fba] 
[SP:P13243] [LE:210475] [RE:211332] [DI : complement] >gp: [GI : ell 862 13 :g2 636249] 
[LN:BSUB0020] [AC : Z99123 : AL009126] [PN : fructose- 1 , 6-bisphosphate aldolase] [GN:fbaA] 
[FNiglycolysis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 4 . 1 . 2 . 13] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: tsr, fba] [SP:P13243] [LE:9165] [RE: 10022] [D I : complement] 
>gp: [GI :ell846 18 :g2 636237] [LN:BSUB0019] [AC : Z99122 : AL009126] 

[PN: fructose- 1, 6 -bisphosphate aldolase] [GN:fbaA] [FN glycolysis] [OR: Bacillus 
subtilis] [DB:genpept] [EC : 4 . 1 . 2 . 13] [DE: Bacillus subtilis complete genome (section 19 
of 21): from 3597091to 3809700.] [NT : alternate gene name: tsr, fba] [SP:P13243] 
[LE : 210475] [RE : 211332] [DI : complement] 
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Description 

pir: [LN:A69662] [AC:A69662 ] [PN : UDP-N-acetylglucosamine 1-carboxyvinyl transferase 
murA] [GN:murA ] [CL : UDP-N-acetylglucosamine 1-carboxyvinyl transferase MurZ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e276830 :gl648861] [LN : BSATPC] [AC:Z81356] 
[PN:UDP-N-acetylglucosamine] [GN:murA] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:B. subtilis atpC gene.] [LE:1943] [RE:3253] [DI: direct] >gp : [GI : el!84582 :g2636201] 
[LN:BSUB0019] [AC:Z99122 :AL009126] [PN : UDP-N-acetylglucosamine] [GN:murA] 
[FNrpeptidoglycan biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.5.1.7] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[LE:179914] [RE:181224] [DI : complement] >gp : [GI :ell84582 :g2 636201] [LN: BSUB0019] 
[AC: Z99122 :AL009126] [PN : UDP-N-acetylglucosamine] [GN : murA] [FN:peptidoglycan 
biosynthesis] [OR:Bacillus subtilis] [DB:genpept] [EC:2.5.1.7] [DE:Bacillus subtilis 
complete genome (section 19 of 21) : from 3597091to 3809700.] [LE: 179914] [RE: 181224] 
[DI : complement] 
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Description 

sp: [LN:MURF_BACSU] [AC:P96613] [GN:MURF] [OR:BACILLUS SUBTILIS] [EC : 6 . 3 . 2 . 15] 
[DE : (D-ALANYL-D- ALANINE- ADDING ENZYME) ] [SP:P96613] [DB : swissprot] >pir : [LN : F69662] 
[AC: F6 9662 ] [PN :UDP-N-acetylmuramoylalanyl-D-glutamyl-2 , 6 -diaminopimelate-D-al murF] 
[GN:murF ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :dl020047 :gl881267] [LN : AB001488] 
[AC:AB001488] [GN:ydbQ] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis genome sequence, 148 kb sequence of the 
regionbetween 35 and 47 degree.] [NT: PROBABLE UDP -N- ACE TYLMURAMOYLALANYL-D- GLUTAMYL- 2 , ] 
[LE:42447] [RE:43820] [DI: direct] >gp : [GI :ell82423 :g2 632757] [LN : BSUB0003] 
[AC:Z99106 :AL009126] [PN : UDP-N-acetylmuramoylalanyl -D-glutamyl -2 , 6 - ] [GN:murF] 
[FN:peptidoglycan biosynthesis] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 6 . 3 . 2 . 15] 
[DE:Bacillus subtilis complete genome (section 3 of 21): from 402751 to611850.] 
[NT:alternate gene name: ydbQ] [SP:P96613] [LE:106191] [RE:107564] [DI: direct] 
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Description 

sp : [LN : SYK_STAAU] [AC:Q53638] [GN : LYSS] [OR : STAPHYLOCOCCUS AUREUS] [EC: 6 . 1 . 1 . 6] 
[DE : LYSYL -TRNA SYNTHETASE, (LYSINE- -TRNA LIGASE) (LYSRS) ] [SP:Q53638] [DB : swissprot] 
>gp: [GI:g567884] [LN : STA5SRR] [AC:L36472] [PN : lysyl - tRNA synthetase] [OR : Staphylococcus 

aureus] [SR : Staphylococcus aureus (clone library: ATCC 12600) DNA] [DB : genpept-bctl] 
[DE : Staphylococcus aureus lysyl- tRNA sythetase gene, complete cds, transfer RNA (tRNA) 

genes, 5S ribosomal RNA (5S rRNA) gene, 16Sribosomal RNA (16S rRNA) gene, 23S ribosomal 

RNA (23S rRNA) gene.] [LE:176] [RE:1663] [DIrdirect] 
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Description 

sp: [LN:YABO_BACSU] [AC:P37557] [GN : YABO] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 9.7 KD 
PROTEIN IN MFD-DIVIC INTERGENIC REGION] [SP:P37557] [DB : swissprot] >pir : [LN : S66089] 
[AC:S66089:F69739 ] [PN : conserved hypothetical protein yabO] [GN:yabO ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005836 :g467448] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR.: Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB: genpept-bctl] [DE :B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:131477] [RE:131737] [DI:direct] >gp : [GI : ell81992 : g2632326] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [GN:yabO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to hypothetical proteins] [SP:P37557] [LE: 67875] [RE: 68135] [DIrdirect] 
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Description 

sp: [LN:YAAT_BACSU] [AC:P37541] [GN:YAAT] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 31.2 
KD PROTEIN IN XPAC-ABRB INTERGENIC REGION] [SP:P37541] [DB : swissprot] >pir : [LN : S66 062] 
[AC:S66062 :A69738 ] [PN: signal peptidase II homolog yaaT] [GN:yaaT ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005810 : g467422] [LN: BAC180K] [AC:D26185] [PNrunknown] 
[OR: Bacillus subtilis] [SR: Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB:genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:105257] [RE:106084] [DI:direct] >gp : [GI:ell81965:g2 632299] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [GN:yaaT] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to signal peptidase II] [SP:P37541] [LE:41655] [RE:42482] [DI: direct] 
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Description 

sp : [LN : KTHY__BACSU] [AC:P37537] [GN: TMK] [OR:BACILLUS SUBTILIS] [EC:2.7.4.9] 
[DE : THYMIDYLATE KINASE, (DTMP KINASE)] [SP:P37537] [DB : swissprot] >pir : [LN: S66058] 
[AC:S66058 :D69724 ] [PN: thymidylate kinase tmk] [GN:tmk ] [CL : dTMP kinase] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005806 : g467418] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] 
[DB:genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:102759] [RE:103397] [DI:direct] >gp: [GI :ell81961:g2632295] [LN : BSUB0001] 
[AC:Z99104:AL009126] [PN : thymidylate kinase] [GN:tmk] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC:2.7.4.9] [DE:Bacillus subtilis complete genome (section 1 of 21): 
from 1 to213080.] [NT : alternate gene name: yaaP] [SP:P37537] [LE:39157] [RE:39795] 
[DI:direct] 
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Description 

gp: [GI:dl045277:g5106180] [LN : AP000064] [AC : AP000064] [PN:353aa long hypothetical 
protein] [GN:APE2475] [OR: Aeropyrum pernix] [SR : Aeropyrum pernix (strain:Kl) DNA] 
[DB:genpept] [DE : Aeropyrum pernix genomic DNA, section 7/7.] [NT: similar to 
OWL:AP00000385 percent identity : 66 . 667] [LE:151512] [RE:152573] [DIrdirect] 
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Description 

sp: [LN:HPPK_BACSU] [AC:P29252] [GN : FOLK] [OR:BACILLUS SUBTILIS] [EC:2.7.6.3] [DE : (HPPK) 
( 6 -HYDROXYMETHYL- 7 , 8 -DIHYDRO PTERIN PYROPHOSPHOKINASE) (PPPK) ] [SP:P2 9252] 
[DBrswissprot] >pir : [LN : S66109] [AC : S66109 : F37854 : F69626 ] 
[PN: 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine 

pyrophosphokinase, : 6-hydroxymethyl-7 , 8 -dihydropterin pyrophosphokinase : 7 , 
8-dihydro-6-hydroxymethylpterin pyrophosphokinase] [GN: f olK ] 
[CL: 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine 

pyrophosphokinase : 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 
homology] [OR:Bacillus subtilis] [EC:2.7.6.3] [DB:pir2] >gp : [GI : dl005856 :g467468] 
[LN:BAC180K] [AC:D26185] [PN : 7 , 8 -dihydro- 6 -hydroxymethylpterin-pyrophosphokin] 
[GN:folk] [OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, 
strain: 168) DNA] [DB : genpept-bctl] [DE:B. subtilis DNA, 180 kilobase region of 
replication origin.] [LE:150545] [RE:151048] [DI:direct] >gp : [GI : ell82012 : g2632346] 
[LN:BSUB0001] [AC:Z99104 :AL009126] [ PN : 7 , 8 - dihydro - 6 - hydroxyme thylpter in] [GN : f olK] 
[FN:dihydrofolate biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.7.6.3] 
[DE:Bacillus subtilis complete genome (section 1 of 21) : from 1 to213080.] [LE: 86943] 
[RE : 87446] [DI : direct] 
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Description 

pir: [LN:S69873] [AC:S69873 ] [PN : hypothetical protein YML009w-a] [OR : Saccharomyces 
cerevisiae] [DB:pir2] [MP:13L ] 



NT AA 

QRF Name NT ID AAID — — j , score probabi lity 
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Description 

pir: [LN:S66080] [AC : S66080 : 140018 : C69629 : S05371 : S18903 ] [PN : UDP-N-acetylglucosamine 
pyrophosphorylase gcaD:cell division protein tms26:tms protein] [GN :gcaD : tms26 ] 
[CL:N-acetylglucosamine-l -phosphate uridyltransf erase] [OR: Bacillus subtilis] [DB :pir2] 
>gp: [GI:dl005827:g467439] [LN:BAC180K] [AC:D26185] [PN : temperature sensitive cell 
division] [GN:tms26] [OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, 
strain: 168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:119952] [RE:121322] [DI:direct] >gp : [GI : ell81983 : g2632317] 
[LN:BSUB0001] [AC:Z99104 :AL009126] [PN :UDP-N-acetylglucosamine pyrophosphorylase] 
[GN:gcaD] [FN:peptidoglycan and lipopolysaccharide] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [EC : 2 . 7 . 7 . 23] [DE:Bacillus subtilis complete genome (section 1 of 
21): from 1 to213080.] [NT : alternate gene name: tms, tms26] [SP:P14192] [LE:56350] 
[RE:57720] [DI:direct] 

NT AA 
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Description 

sp: [LNrYACC_BACSU] [AC:P37565] [GN : YACC] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 31.8 
KD PROTEIN IN FTSH-CYSK INTERGENIC REGION] [SP:P37565] [DB r swissprot] >pir : [LN : S66101] 
[AC:S66101:F69740 ] [PN : conserved hypothetical protein yacC] [GN:yacC ] [CL : conserved 
hypothetical protein slll988] [ORrBacillus subtilis] [DB:pir2] 

>gp: [GI:dl005848:g467460] [LN:BAC180K] [AC:D26185] [PNrunknown] [ORrBacillus subtilis] 
[SRrBacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept - be t 1 ] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 143479] [RE: 144354] 
[DI:direct] >gp : [GI : ell82004 :g2632338] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yacC] 
[FNrunknown] [ORrBacillus subtilis] [DB : genpept -bctl] [DErBacillus subtilis complete 
genome (section 1 of 21): from 1 to213080.] [NTrsimilar to hypothetical proteins] 
[SP:P37565] [LEr79877] [RE:80752] [DIrdirect] 



ORF Name 



NTID 



_ _ -p y. NT AA 

AAID — - — , , score 

LengthLength 



probability 



E 



17503000980 21579131 t3 157 



j |158 | [3930 ||T53 | 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — — _ score 
LengthLength 



E 



17503000980 22694002 c3 258 



TF9~ 



|393i |p^~[|i8i |prr 



probability 
1.7e-52 



Description 

sp r [LN r HPRT_BACSU] [ACrP37472] [GN r HPRT : HPT] [OR r BACILLUS SUBTILIS] [ECr2.4.2.8] 
[DEr (HGPRTASE) ] [SPrP37472] [DB r swissprot] >pir r [LN r S66098] [AC r S66098 r E69642 ] 
[PN : hypoxanthine phosphoribosyl transferase , hprT : hypoxanthine -guanine 

phosphoribosyltransf erase hprT] [GNrhprT ] [CL : hypoxanthine phosphoribosyl transferase] 
[OR:Bacillus subtilis] [EC:2.4.2.8] [DB:pir2] >gp r [GI r dl005845 r g467457] [LN:BAC180K] 
[AC r D26185] [PN r hypoxanthine -guanine phosphoribosyltransf erase] [GN r hprt ] [OR r Bacillus 
subtilis] [SRrBacillus subtilis (sub_species : Marburg, strainrl68) DNA] 
[DB :genpept-bctl] [DErB. subtilis DNA, 180 kilobase region of replication origin.] 
[LErl39944] [RErl40486] [DIrdirect] >gp r [GI rell82001rg2 632335] [LN: BSUB0001] 
[AC:Z99104 :AL009126] [PN : hypoxanthine -guanine phosphoribosyltransf erase] [GNrhprT] 
[FNrpurine salvage] [ORrBacillus subtilis] [DB r genpept -bctl] [ECr2.4.2.8] [DErBacillus 
subtilis complete genome (section 1 of 21) r from 1 to213080.] [SPrP37472] [LEr76342] 
[RE r 76884] [DI rdirect] 



NT AA 

ORF Name NTID AAID — L1 , — , score probability 
LengthLength L - 



|AI7503000980_23445130_clJ>07 | | 160 | [ 3932 | pT^[ | 138 | p^"|| 4 . le- 08 

Description 

gpr [GIrell82000rg2632334] [LN r BSUB0001] [AC r Z99104 : AL009126] [GNryacA] [FNrunknown] 
[ORrBacillus subtilis] [DB r genpept -bctl] [DErBacillus subtilis complete genome (section 
1 of 21) r from 1 to213080.] [NTrsimilar to cell-cycle protein] [LEr74927] [REr76345] 
[DI rdirect] 



118 



ORF Name 



NT AA 
NTID AAID — « _ — ^ scor e 
LengthLength 



probability 



IAI 7 503000980 23601702 ci 190 



] |161 | [3933 ||891 ||296 ||919 ||3.1e-gT 



Description 

Sp: [LN : KSGA_BACSU] [AC:P37468] [GN: KSGA] [OR:BACILLUS SUBTILIS] [EC:2.1.1.-] 
[DE : DIMETHYLTRANSFERASE) ] [SP:P37468] [DB : swissprot] >pir: [LN:S66071] [AC : S66071 : A6964 9 
] [PN: dimethyl adenosine transferase ksgA: high level kasgamycin resistance protein ksgA] 
[GN: ksgA ] [CLrrRNA (adenine-N6 -) -methyltransf erase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005819:g467431] [LN:BAC180K] [AC:D26185] [PN:high level kasgamycin 
resistance] [GN : ksgA] [OR: Bacillus subtilis] [SR: Bacillus subtilis 
(sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE:114240] [RE:115118] [DI:direct] 
>gp: [GI:ell81975:g2632309] [LN : BSUB0001] [AC : Z99104 : AL009126] [PN : dimethyladenosine 
transferase] [GN : ksgA] [FN: high level kasugamycin resistance] [OR: Bacillus subtilis] 
[DB: genpept-bctl] [EC:2.1.1.-] [DE:Bacillus subtilis complete genome (section 1 of 21): 
from 1 to213080.] [SP:P37468] [LE:50638] [RE: 51516] [DI: direct] 



ORF Name 



NT AA 

NTID AAID ^,_ T , - score 

LengthLength 



probability 



E 



17503000980 23605438 c2 238 



] | lg2 | |3934 |pg~||47 | 



Description 
NO-HIT 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



IAI7503000980 23631327 c3 250 



J |163 | [3935 |pgg— 1|321 |JI22F 



probability 
l.le-124 



Description 

gp: [GI:g2289093] [LN: CAU76387] [AC:U76387] [PN:PRPP synthetase] [GN:prs] 
[OR : Corynebacterium ammoniagenes] [DB : genpept-bctl] [EC: 2. 7. 6.1] [DE :Corynebacterium 
ammoniagenes N- acetyl glucoseamine 1-phosphateuridyl transferase (glmU) gene, partial 
cds, and PRPP- synthetase (prs ) gene, complete cds . ] [LE:321] [RE:1274] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID . , _ _ v 

LengthLength 



score 



IAI7503000980 23642135 ci 185 



[3935 ||357"]|118 |pT 



probability 
|4.5e-36 



Description 

sp: [LN:YAAQ_BACSU] [AC:P37538] [GN : YAAQ] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 12.0 
KD PROTEIN IN XPAC-ABRB INTERGENIC REGION] [SP:P37538] [DB : swissprot] >pir : [LN : S66059] 
[AC:S66059:G69737 ] [PN : conserved hypothetical protein yaaQ] [GN:yaaQ ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005807 : g467419] [LN: BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:103471] [RE:103800] [DI:direct] >gp : [GI : ell81962 : g26322 96] [LN : BSUB0001] 
[AC:Z99104:AL009126] [GN:yaaQ] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to hypothetical proteins] [SP:P37538] [LE:39869] [RE:40198] [DI: direct] 



ORF Name 



NTID 



NT 

AAID _ 



AA 



LengthLength 



E 



238S0952 ci 206 



][ 



[3337 | p^T 



score probability 
304 ||4.5e-27 



Description 

gp: [GI:g4981097] [LN : AE001733] [AC : AE001733 : AE000512 ] [PN : conserved hypothetical 
protein] [GN:TM0579] [OR : Thermotoga maritima] [DB :genpept-bct2] [DE : Thermotoga maritima 
section 45 of 136 of the complete genome.] [NT:similar to GB:AE000657 percent identity: 
59.32;] [LE:230] [RE:1474] [DI:direct] 



119 



NT AA 

QRF Name ^5 L e^thLe^th ^^ probability 
piI 7 50a0009B0_24il>09bi_tJ_120 | | |393S [p^~~[ p |p 1| 0.017 "| 

Description 

pir: [LN:G71244] [AC:G71244 ] [PN : hypothetical protein PH0217] [GN:PH0217 ] 
[OR : Pyrococcus horikoshii] [DB:pir2] >gp : [GI :dl030229 :g3256603] [LN : AP000001] 
[AC:AP000001 :AB009465 :AB009464 : AB009466 : AB009467 :AB009468 :AB009469] [PN: 106aa long 

hypothetical protein] [GN:PH0217] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain :0T3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii 0T3 genomic DNA, 1-287000 

nt. position (1/7).] [LE:191072] [RE:191392] [DI : complement] 



NT AA 

QRF Name NT ID AAID — — score probability 



Length Length 

AI750300098Q_2464 7936_c3_2 56 [ |i£7 | |3939 ||408 | [ITS [[172 ||4.4e-13 



Description 

gp: [GI :g4090866] [LN : AF023 181] [AC : AF023181] [PN:DivIC homolog] [GN: divL] [OR:Listeria 
monocytogenes] [DB : genpept -bct2 ] [DE:Listeria monocytogenes transcription-repair 
coupling factor (mf dL) , low temperature requirement B protein (ltrB) , and DivIC 
homolog (divL) genes, complete cds . ] [LE:6077] [RE: 6463] [DI: direct] 

NT AA 

QRF Name NTID AAID Le ~ tnLe ~ th score probability 



AI7B03000980_24645092_c2_236 | | [ 3940 || 378 ~|| 125 | p5I~|| 4 . 7e- J2 "] 



Description 

sp: [LN : FOLB_STAHA] [AC:Q59920] [GN : FOLB : FOLQ] [OR : STAPHYLOCOCCUS HAEMOL YT I CUS ] 
[EC:4.1.2.25] [DE : D I HYDRONEO PTERIN ALDOLASE, (DHNA) (FRAGMENT)] [SP:Q59920] 
[DB:swissprot] >gp : [GI : glll8003] [LN: SHU40768] [AC:U40768] [PN : dihydroneopterin 
aldolase] [GN: f olQ] [OR : Staphylococcus haemolyticus] [DB : genpept-bct2] [EC : 4 . 1 . 2 . 25] 
[DE: Staphylococcus haemolyticus cysteine synthase A (cysK) anddihydroneopterin aldolase 
(folQ) genes, partial cds, anddihydropteroate synthase (folP) gene, complete cds.] 
[NT :DHNA] [LE:1467] [RE:>1724] [DI: direct] 



NT AA 

QRF Name NTID AAID — , — , , score probability 

LengthLength s - 



IAI7503000980 24653892 c3 251 



[3941 [[1881 | [626 |pT2? 



I.Se-124 



Description 

Sp : [LN : MFD_BACSU] [AC:P37474] [GN:MFD] [OR:BACILLUS SUBTILIS] [DE : TRANSCRIPTION- REPAIR 
COUPLING FACTOR (TRCF) ] [SP:P37474] [DB : swissprot] >pir : [LN: S66085] [AC : S66085 : F6 9657 ] 
[PN: transcription- repair coupling factor mfd] [GN:mfd ] [CL : transcription- repair 
coupling protein :DEAD/H box helicase homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005832:g467444] [LN:BAC180K] [AC:D26185] [PN : transcription- repair coupling 
factor] [GN:mfd] [OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, 
strain: 168) DNA] [DB : genpept -bet 1] [DE :B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:124030] [RE:127563] [DI:direct] >gp : [GI :ell81988 :g2632322] 
[LN:BSUB0001] [AC : Z99104 : AL009126] [PN : transcription-repair coupling factor] [GN:mfd] 
[FN:probably involved in homologous DNA] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [SP:P37474] 
[LE:60428] [RE:63961] [DI: direct] 



NT AA 

QRF Name NTID AAID — ^ — , , score probability 
LengthLength *- 



A^7503000980_24734661_tl_13 | |170 | [3942 |[TTg"~| | 45 | 



Description 
NO-HIT 



120 



ORF Name 



NT 



AA 



NTI2 MID Le - thLe ^th ^^ probability 



[AI7503000980 24790916 el 208 



] | 171 | | 3943 |P 106 1 1 701 |pi33 |[6.9e-221 



Description 

sp: [LN:FTSH_BACSU] [AC:P37476] [GN : FTSH] [OR : BACILLUS SUBTILIS] [EC : 3 . 4 . 24 . - ] [DE : CELL 
DIVISION PROTEIN FTSH HOMOLOG, ] [SP:P37476] [DB : swissprot] >pir : [LN : E69627] 
[AC:E69627:S66099 ] [PN : cell-division protein / general stress protein ftsHrclass III 
heat shock protein ftsH] [GN : f tsH ] [CLrcell division protein ftsH: 

FtsH/SEC18/CDC48-type ATP-binding domain homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :dl005846 :g467458] [LN:BAC180K] [AC:D26185] [PN:cell division protein] [GNrftsH] 
[OR:Bacillus subtilis] [SRrBacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB : genpept-bctl] [DE:B. subtilis DNA, 180 kilobase region of replication origin.] 
[LE:140584] [RE:142497] [DI: direct] >gp : [GI : ell820 02 :g2 632336] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [PN : cell-division protein and general stress protein] [GNrftsH] 
[FN: involved in major cellular processes such as] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC : 3 . 4 . 24 . -] [DE:Bacillus subtilis complete genome (section 1 of 
21): from 1 to213080.] [SP:P37476] [LE:76982] [RE:78895] [DI:direct] 



ORF Name 



NT AA 

NT ID AAID Le ~ thLe ~ th score probability 



I&I7563000980 25662965 ci 195 



] | 172 | | 3944 || 725 || 242 |pT""[|g.4e-44 



Description 

sp: [LN:CTC_BACSU] [AC:P14194] [GN:CTC] [OR:BACILLUS SUBTILIS] [DE : GENERAL STRESS 
PROTEIN CTC] [SP:P14194] [DB : swissprot] 



ORF Name 



NT AA 

NT ID AAID — ^ T — , , score probabilit y 
LengthLength c 



IAI7503000980 25210061 12 99 



~] |173 [ [3945 1|47 | 



Description 
NO-HIT 



ORF Name 



NT AA 

NT ID AAID — _ „ — k , score probabi lity 
LengthLength c L - 



AI7503000980 26839660 C3 246 



] | 174 | | 3946 — | pa4 |pra~ 



4.4e-84 



Description 

gp: [GI:el386912:g4454322] [LN : SAU132803] [AC : AJ132803] [PN : hypothetical protein] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus ORF1 and ORF2 
(partial).] [NT : ORF2 ] [LE:1402] [RE:>2052] [DIrdirect] 



ORF Name 



NT AA 

NT ID AAID — _^ — , _ score probabili ty 
LengthLength c L - 



IAI7503000980 2928502 c3 262 



175 | [3947 |p^r~| prr~ 



1166 ||2.1e-ii8 



Description 

sp: [LN:CYSK_BACSU] [AC:P37887] [GN : CYSK] [OR:BACILLUS SUBTILIS] [EC : 4 . 2 . 99 . 8] 
[DE: PROTEIN 11) (SOI11) ] [SP:P37887] [DB : swissprot ] 



ORF Name 



IAI7503000980 29298162 t2 118 



Description 
NO-HIT 



NT AA 

NT ID AAID Le ~ thLe ~ th score probability 



|176 | [3948 |p3~||40 | 



ORF Name 



p7503000980_29307i87_c2_2i3 

Description 

NO-HIT 



NT AA 

NT ID AAID — — , , score probabilit y 
L en gthLength *- 



|177 | [3949 p 



121 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000980 3144&881 c2 215 



] I 178 1 1 3850 IP^ 17 !! 116 IP*" 



probability 
2.1e-15 



Description 

sp: [LN:YABA_BACSU] [AC:P37542] [GN : YABA] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 14.1 
KD PROTEIN IN XPAC-ABRB INTERGENIC REGION] [SP:P37542] [DB : swissprot ] >pir : [LN : S66063 ] 
[AC:S66063:B69738 ] [PN : hypothetical protein yabA] [GN:yabA ] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl005811 :g467423] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus 
subtilis] [SRrBacillus subtilis (sub_species : Marburg, strain:168) DNA] 
[DB :genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:106099] [RE:106458] [DI : direct] >gp: [GI :ell81966 :g2 632300] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [GN:yabA] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [SP:P37542] 
[LE:42497] [RE:42856] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — 1T — . , score 
LengthLength 



probability 



7503000980 32S15811 13 155 



TTT 



3951 IH55" 



62 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



IA17503000980 33225017 c2 222 



][ 



TS7T 



[3952 |[gg7~||2S8 ||75? 



score probability 
8 . 8e-79 " 



Description 

sp: [LN:PURR_BACSU] [AC:P37551] [GN: PURR] [OR:BACILLUS SUBTILIS] [DE : PUR OPERON 
REPRESSOR] [SP:P37551] [DB : swissprot] >pir : [LN : S66076] [AC : S66076 : D69685 ] 
[PN: transcription repressor of purine operon purR] [GN :purR ] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl005824 : g467436] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] 
[DB: genpept-bctl] [DE:B. subtilis DNA, 180 kilobase region of replication origin.] 
[LE:118041] [RE:118898] [DI:direct] >gp : [GI : ell81980 :g2632314] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [PN : transcriptional regulator] [GN:purR] [FN:negative regulation 
of the purine operon] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 1 of 21): from 1 to213080.] [NT : alternate gene name: yabl] 
[SP:P37551] [LE:54439] [RE:55296] [DI:direct] 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



score 



IAI7503000980 33313817 cl 192 



] |181 | [3953 |pTS~]|110 |pr 



probability 
4.4e-29 



Description 

sp: [LN:SP5G_BACME] [AC:P28016] [GN:SPOVG] [OR:BACILLUS MEGATERIUM] [DE : STAGE V 
SPORULATION PROTEIN G] [SP:P28016] [DB : swissprot] >pir: [LN:S18900] [AC:S18900 ] 
[PN:spoVG protein] [CL: stage V sporulation protein spoVG] [OR: Bacillus megaterium] 
[DB:pir2] >gp: [GI:g39656] [LN:BMSPOVG] [AC:X62377] [GN:spoVG] [OR: Bacillus megaterium] 
[DB: genpept-bctl] [DE : B . megaterium spoVG and tms genes.] [SP:P28016] [LE:31] [RE:321] 
[DI: direct] 



122 



NT AA 

QRF Name ™ Le^thLe^th^^ probability 



|AI 7 B03000980_J4001S10_c2J*lB | | 182 | p9E>4 |[7^~| | 265 | pi& || 6.4e-92 

Description 

sp: [LN:YABD_BACSU] [AC:P37545] [GN : YABD] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 29.2 
KD PROTEIN IN METS-KSGA INTERGENIC REGION] [SP:P37545] [DB : swissprot] >pir : [LN : S66068] 
[AC:S66068 :E69738 ] [PN : conserved hypothetical protein yabD] [GNryabD ] 
[CL: hypothetical protein HI0454] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl005816:g467428] [LN : BAC180K] [AC:D26185] [PN:unknown] [ORrBacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB :genpept-bctl] [DE : B . 

subtilis DNA, 180 kilobase region of replication origin.] [LE : 111306] [RE: 112073] 
[DI: direct] >gp: [GI :ell81972 :g2632306] [LN : BSUB0001] [AC : Z99104 : AL009126] [GNryabD] 
[FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DEiBacillus subtilis complete 

genome (section 1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] 
[SP:P37545] [LE:47704] [RE:48471] [DI:direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AI7S03000960_34428S16_clJ.9i | |153 | [ 3955 | pB~| | 94 | p3~ | |fi . Oe-25 



Description 

sp: [LN:VEG_BACSU] [AC:P37466] [GN: VEG] [OR:BACILLUS SUBTILIS] [DE : VEG PROTEIN] 
[SP:P37466] [DB : swissprot] >pir : [LN : S66073 ] [AC : S66073 : C69730 ] [PN : hypothetical 
protein veg] [GN:veg ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : d!005821 :g467433] 
[LN:BAC180K] [AC:D26185] [PN:unknown] [GN:veg] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 
180 kilobase region of replication origin.] [LE: 116363] [RE: 116623] [DI:direct] 
>gp: [GI:ell81977:g2632311] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:veg] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
1 Of 21): from 1 to213080.] [SP:P37466] [LE:52761] [RE:53021] [DI:direct] 



NT AA 

QRF Name NTID AAID — , — , score probability 
Length Length c 



A17b03000880_345710ii_ca_2!>2 | |184 | [ 3956 | |160 r| | 535 || 1712 | 



2.8e-176 



Description 

Sp : [LN : MFD_BACSU] [AC:P37474] [GN : MFD] [OR:BACILLUS SUBTILIS] [DE : TRANSCRIPT I ON -RE PAIR 
COUPLING FACTOR (TRCF) ] [SP:P37474] [DB : swissprot ] >pir : [LN : S66085] [AC : S66085 : F69657 ] 
[PN: transcription-repair coupling factor mfd] [GN:mfd ] [CL : transcription-repair 
coupling protein :DEAD/H box helicase homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI :dl005832 :g467444] [LN:BAC180K] [AC:D26185] [PN : transcription-repair coupling 
factor] [GN:mfd] [OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, 
strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:124030] [RE:127563] [DI:direct] >gp : [GI : ell81988 :g2632322] 
[LN:BSUB0001] [AC : Z99104 : AL009126] [PN : transcript ion- repair coupling factor] [GN:mfd] 
[FN:probably involved in homologous DNA] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [SP:P37474] 
[LE:60428] [RE:63961] [DI:direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



1AI7S03000980 35250887 cl 193 1 



Description 
NO-HIT 



] |18S | [3957 | 



1141 I 



155" 



123 



ORF Name 



NT ID 



NT AA 
AAID — -, — , , score 
LengthLength 



probability 



[Ai7b03000980 3bb3 c3 24b 



■j |186 | [3958 ||972 1 1323 ||382 ||2,5e-35 



Description 

sp: [LN:HOLB_BACSU] [AC:P37540] [GN : HOLB] [OR : BACILLUS SUBTI LIS] [EC:2.7.7.7] [DE:DNA 
POLYMERASE III, DELTA' SUBUNIT,] [SP:P37540] [DB : swissprot] >pir : [LN : S66061] 
[AC:S66061:C69642 ] [ PN : DNA polymerase III (delta* subunit) holB:dnaH homolog holB] 
[GN: holB ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl005809 :g467421] [LN:BAC180K] 
[AC:D26185] [PN:similar to B. subtilis DnaH] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (sub_species : Marburg, strain:168) DNA] [DB :genpept-bctl] [DE:B. subtilis DNA, 
180 kilobase region of replication origin.] [LE: 104265] [RE: 105254] [DI: direct] 
>gp: [GI:ell81964:g2632298] [LN : BSUB0001] [AC : Z99104 : AL009126] [PN: DNA polymerase III 
(delta' subunit)] [GN : holB] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.7.7.7] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] 
[NT:alternate gene name: yaaS] [SP:P37540] [LE:40663] [RE:41652] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID _ Ut _ v 

LengthLength 



score 



^17503000980 3915087 c2 215 



it 



[3959 |pgT — [ |86 |pi~| 



probability 
1.3e-20 



Description 

pir: [LN:A69742] [AC:A69742 ] [PN : conserved hypothetical protein yazA] [GN :yazA ] 
[CL: hypothetical protein 312] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI :ell81968 :g2632302] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yazA] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] [LE:43645] [RE:43944] 
[DI : direct] 



ORF Name 



E 



±7b03000980 39379b0 r2 71 



Description 
NO-HIT 



NTID 



][ 



TF5~ 



NT AA 
AAID — — score 
LengthLength 



probability 



[3960 p~ 



ORF Name 



IAI7S03000980 3938838 cl 210 



NTID 



AAID 



NT 



AA 



LengthLength 



— , score probability 



] I 189 1 I 3561 |p8i |prer| 



|4.8e-110 



Description 

sp: [LN:DHPS_STAHA] [AC:Q59919] [GN: FOLP] [OR : STAPHYLOCOCCUS HAEMOLYT I CUS ] [EC : 2 . 5 . 1 . 15] 
[DE : PYROPHOSPHORYLASE) (DHPS) ] [SP:Q59919] [DB : swissprot] >gp : [GI : glll8002 ] 
[LN:SHU40768] [AC:U40768] [PN : dihydropteroate synthase] [GNrfolP] [OR : Staphylococcus 
haemolyticus] [DB :genpept-bct2] [EC : 2 . 5 . 1 . 15] [DE : Staphylococcus haemolyticus cysteine 
synthase A (cysK) anddihydroneopterin aldolase (folQ) genes, partial cds, 
anddihydropteroate synthase (folP) gene, complete cds.] [NT : DHPS] [LE:692] [RE: 1495] 
[DI: direct] 



ORF Name 



NT 

NTID AAID . — 



AA 



LengthLength 



score 



probability 



|AI7b03000980 4034707 t2 114 



Description 
NO-HIT 



] | | 3962 IP^Hl 47 1 



124 



NT AA 

QRF Name S™ L e^thLe^th ^^ probability 



|A17S030009BO_4V2J192_clJL9fc | | 191 | [ 3963 || 606 | |201 | p5~ | [B . 7e- b2 

Description 

sp: [LN:SP5C_BACSU] [AC:P37470] [GN: SPOVC : PTH] [OR : BACILLUS SUBTILIS] [EC : 3 . 1 . 1 . 29] 
[DE: SPORULATION PROTEIN C) ] [SP:P37470] [DB : swissprot ] >pir : [LN : C69715] 

[AC:C69715:S66083 ] [PN: stage V sporulation protein spoVC: spore coat formation protein 
spoVC] [GN:spoVC ] [CL:peptidyl-tRNA hydrolase] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005830 :g467442] [LN:BAC180K] [AC:D26185] [PN:stage V sporulation] [GN:spoVC] 
[OR: Bacillus subtilis] [SR: Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB:genpept-bctl] [DE:B. subtilis DNA, 180 kilobase region of replication origin.] 
[LE:123104] [RE:123670] [DI:direct] >gp: [GI : ell81986 : g2632320] [LN: BSUB0001] 
[AC:Z99104 :AL009126] [GN:spoVC] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 1 of 21): from 1 to213080.] [NT : thermosensitive 
mutant blocks spore coat formation] [SP:P37470] [LE:59502] [RE:60068] [DI:direct] 

NT AA 

QRF Name NT ID AAID — « — , , score probability 
LengthLength ^ -L 



|&I7503000980_4775312_c3_257 | |i52 | [3964 |pr~j |i34 j | 435 || 4.7e-41 

Description 

Sp : [LN : YABR_BACSU] [AC:P37560] [GN: YABR] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 14.2 
KD PROTEIN IN DIVIC-SPOIIE INTERGENIC REGION] [SP:P37560] [DB : swissprot] 
>pir: [LN:C53380] [AC : C53380 : S66093 : A69740 ] [PN : polyribonucleotide 

nucleotidyltransferase homolog yabR:divIC 3' -region hypothetical protein] [GN:yabR ] 
[CL: polyribonucleotide nucleotidyltransferase homolog yabR] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI :dl 005840 :g4 67452] [LN:BAC180K] [AC:D26185] [PN:unknovm] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB: genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:133226] [RE:133612] [DI:direct] >gp : [GI :g385178] [LN: BACDIVIC] [AC:L23497] 
[PN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis cell 
division protein (divIC) gene, complete cds, transfer RNA genes, complete sequence and 
sporulation protein (spoIIE) gene, 5' end of cds.] [NT:orfl28; homologous to RNA binding 
domain of E. coli] [LE:914] [RE:1300] [DI:direct] >gp : [GI : ell81996 :g2632330] 
[LN:BSUB0001] [AC : Z99104 : AL009126] [GN:yabR] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 1 of 21) : from 1 
to213080.] [NT: similar to polyribonucleotide] [SP:P37560] [LE : 69624] [RE: 70010] 
[DI : direct] 



NT AA 

QRF Name NTID AAID — . — . , score probability 
LengthLength c £ - 



A:£75030009g0_4684625_cl_lB9 | [193 | [3955 | 182 | | 445 || S.2e-42 



Description 

Sp : [LN : YABF_BACSU] [AC:P37547] [GN : YABF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 20.7 
KD PROTEIN IN METS-KSGA INTERGENIC REGION] [SP:P37547] [DB : swissprot] >pir : [LN : S66070] 
[AC:S66070:G69738 ] [PN : conserved hypothetical protein yabF] [GN:yabF ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005818 :g467430] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB: genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE: 113687] [RE: 114247] [DI:direct] >gp : [GI : el 181974 :g2 632308] [LN :BSUB0001] 
[AC:Z99104:AL009126] [GN:yabF] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to hypothetical proteins] [SP:P37547] [LE:50085] [RE:50645] [DI:direct] 

NT AA 

QRF Name NTID AAID — , — , , score probabi lity 

LengthLength c 



A17S03000980_5317161_c3_264 | |194 | |3965 ||177 | p~ 



Description 
NO-HIT 



125 



NT AA 

ORFName NTID AAJP Le ~ thLe ~ th gco£g probability 



A!T7503000980_534401b_c3_2b5 | | [ 3957 | p5T] | 4 2 6 | |gyg~[| U . be-Ub — | 



Description 

sp: [LN:YABN_BACSU] [AC:P37556] [GN : YABN] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 56.1 
KD PROTEIN IN MFD-DIVIC INTERGENIC REGION] [SP:P37556] [DB : swissprot] >pir : [LN : S66088] 
[AC:S66088 :E69739 ] [PN : conserved hypothetical protein yabN : beta- lactamase regulatory 
protein homolog yabN] [GN:yabN ] [CL : beta- lactamase regulatory protein homolog: 
beta- lactamase regulatory protein homology] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:dl005835:g467447] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SRrBacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB :genpept-bctl] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 130005] [RE: 131474] 
[DIrdirect] >gp: [GI :ell81991 :g2632325] [LN : BSUB0001] [AC : Z99104 : AL009126] [GNiyabN] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] 
[SP:P37556] [LE:66403] [RE:67872] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — score probability 
LengthLength c 



AI7503000980_5367Si3_c3_244 | [195 | [ 3968 [180 | p? 



i.2e-40 



Description 

sp: [LN:ATDA_ECOLI] [AC:P37354] [GN: SPEG] [OR : ESCHERICHIA COLI] [EC : 2 . 3 . 1 . 57] 
[DE : ACETYLTRANSFERASE) (SAT)] [SP:P37354] [DB : swissprot ] >gp : [GI : dl016007 :gl742583] 
[LN:D90799] [AC : D90799 : AB001340] [PN : Spermidine Nl-acetyltransf erase (EC 2.3.1.57)] 
[GN : speG] [OR Escherichia coli] [SR : Escherichia coli (strain:K12) DNA, clone_lib : Kohara 
lambda minise] [DB :genpept-bctl] [DE:E.coli genomic DNA, Kohara clone #308(35.3-35.7 
min.).] [NT:0RF_ID:o3 09#16; similar to [SwissProt Accession] [LE:<18154] [RE:18711] 
[DI:direct] >gp: [GI :dl016029 :gl742606] [LN:D90800] [AC : D90800 : ABO 01 340] [PN : Spermidine 
Nl-acetyltransf erase (EC 2.3.1.57)] [GN: speG] [OR : Escherichia coli] [SR : Escherichia 
coli (strain:K12) DNA, clone_lib : Kohara lambda minise] [DB :genpept-bctl] [DE:E.coli 
genomic DNA, Kohara clone #309(35.4-35.7 min.).] [NT : ORF_ID : o309#16 ; similar to 
[SwissProt Accession] [LE:<14930] [RE:15487] [DI:direct] >gp : [GI : dl016031 :gl742609] 
[LN:D90801] [AC :D90801 : ABO 01 340] [PN : Spermidine Nl -acetyltransf erase (EC 2.3.1.57)] 
[GN : speG] [OR : Escherichia coli] [SR : Escherichia coli (strain:K12) DNA, clone_lib : Kohara 
lambda minise] [DB :genpept-bctl] [DE:E.coli genomic DNA, Kohara clone #310(35.7-36.0 
min.).] [NT:ORF_ID:o309#16; similar to [SwissProt Accession] [LE:<814] [RE:1371] 
[DI:direct] 

NT AA 

ORF Name NTID AAID — _ T — . , score probability 

LengthLength c £ - 



AI7503000980_581250_t3_137 | |197 | [ 3959 || 183 1 160 | 



Description 
NO-HIT 



126 



NT AA 

ORF Name NTID AAID Le — thLe — th score probability 



AI7B030009B0J>01bB42_cJ_249 | [ 198 | [ 3970 ||900 ~]| 299 | |750~| p ■ be-74 



Description 

sp: [LN:YABH_BACSU] [AC:P37550] [GN:YABH] [ORrBACILLUS SUBTILIS] [DE : HYPOTHETICAL 31.7 
KD PROTEIN IN SSPF-PURR INTERGENIC REGION (ORF1) ] [SP:P37550] [DB : swissprot] 
>pir: [LN:S66075] [AC : S66075 : A69739 ] [PN : conserved hypothetical protein yabH] [GNryabH 
] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl005823 :g467435] [LN : BAC180K] [AC:D26185] 
[PNrunknown] [ORrBacillus subtilis] [SRrBacillus subtilis (sub_species r Marburg, 
strain: 168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:117116] [RE:117985] [DIrdirect] >gp : [GI : ell81979 :g2632313] 
[LN : BSUB0001] [AC : Z99104 : AL009126] [GNryabH] [FNrunknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 1 of 21) : from 1 
to213080.] [NT: similar to hypothetical proteins] [SP:P37550] [LE: 53514] [RE: 54383] 
[DIrdirect] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|A:HB03000980_6136S62_c2_223 | |199 | [ 3971 |p^~| | 133 | p^~| |4 . le-33 "| 

Description 

Sp : [LN : YABJ_BACSU] [AC:P37552] [GN: YABJ] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 13.7 
KD PROTEIN IN PURR-SPOVG INTERGENIC REGION (ORF2)] [SP:P37552] [DB : swissprot] 
>pir: [LN:S66077] [AC : S66077 : B69739 ] [PN : conserved hypothetical protein yabJ] [GN:yabJ 
] [CL: hypothetical protein HI0719] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl005825:g467437] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SRrBacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB :genpept-bctl] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE:118895] [RE:119272] 
[DI:direct] >gp: [GI :ell81981:g2 632315] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yabJ] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] 
[SP:P37552] [LE:55293] [RE:55670] [DIrdirect] 



NT AA 

ORF Name NTID AAID — ^ T — , , score probability 
LengthLength ^ 

IAI7503000980 5742943 c2 217 1 (TuTT 



[3972 ||1983-[|660 | p5T]|9 . 8e-2 JW 



Description 

sp: [LN:SYM_BACSU] [AC:P37465] [GN:METS] [OR : BACILLUS SUBTILIS] [EC : 6 . 1 . 1 . 10] 
[DE : (METRS) ] [SP:P37465] [DB : swissprot ] >pir: [LN:S66067] [AC : S66 067 : E69657 ] 
[PN: methionine --tRNA ligase, metS :methionyl-tRNA synthetase metS] [GN:metS ] 
[CL: methionine --tRNA ligase] [ORrBacillus subtilis] [EC : 6 . 1 . 1 . 10] [DB:pir2] 
>gp: [GIrdl005815:g467427] [LNr BAC180K] [ACrD26185] [PN r met hionyl -tRNA synthetase] 
[GNrmetS] [ORrBacillus subtilis] [SRrBacillus subtilis (sub_species : Marburg, 
strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:109233] [RE:111227] [DI:direct] >gp : [GI : ell81971 r g2632305] 
[LN r BSUB0001] [AC r Z99104 r AL009126] [PN : methionyl - tRNA synthetase] [GNrmetS] 
[ORrBacillus subtilis] [DB r genpept-bctl] [EC r 6 . 1 . 1 . 10] [DErBacillus subtilis complete 
genome (section 1 of 21) r from 1 to213080.] [SPrP37465] [LEr45631] [REr47625] 
[DI r direct] 



127 



NT AA 

QRF Name NTID AM5 Le ^ thLe - th score probability 



AI7503000980_6834427_c2_2:>7 | | 201 | [ 39 7 3 | | lbQ ^| |49 | p 1| 0. 00010 



Description 

sp: [LN:YABO_BACSU] [AC:P37557] [GN : YABO] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 9.7 KD 
PROTEIN IN MFD-DIVIC INTERGENIC REGION] [SP:P37557] [DB : swissprot] >pir : [LN : S6608 9] 
[AC:S66089 :F69739 ] [PN : conserved hypothetical protein yabO] [GN:yabO ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005836 :g467448] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB: genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:131477] [RE:131737] [DI:direct] >gp : [GI :ell81992 :g2 632326] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [GN:yabO] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DErBacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to hypothetical proteins] [SP:P37557] [LE:67875] [RE:68135] [DI:direct] 



NT AA 

QRF Name NT ID AAID — . — . , score probability 
Length Length *- 

LAI7503000980 7036526 ci 212 



[3974 ||IT3— |p ||BS [[0.00057 



Description 

pir: [LN:C64571] [AC:C64571 ] [PN : hypothetical protein HP0411] [OR : Helicobacter pylori] 
[DB:pir2] >gp: [GI :g2313526] [LN : AE000557] [AC : AE000557 : AE000511] [PN:H. pylori 

predicted coding region HP0411] [GN:HP0411] [OR : Helicobacter pylori 26695] 
[DB:genpept-bct2] [DE Helicobacter pylori 26695 section 35 of 134 of the complete 

genome.] [NT hypothetical protein; identified by GeneMark;] [LE:3068] [RE:3385] 
[DI : direct] 

NT AA 

QRF Name NT ID AAID Le ~ thLe ~ th score probability 



AI7503 000980_761_cl_187 


| |203 | |3975 ||750 | |249 ||919 ||3.ie-92 | 


Description 




gp: [GI:el386911:g4454321] 
[OR staphylococcus aureus] 
(partial).] [NT : ORF1] [LE: 


[LN:SAU132803] [AC : AJ132 803] [PN : hypothetical protein] 

[DB:genpept-bctl] [DE : Staphylococcus aureus ORF1 and ORF2 
434] [RE: 1159] [DI: direct] 


ORF Name 


NT AA 

NT ID AAID , — . , , — . , score probability 
Leny Lh Leng Lh c *- 


A17S03000980_829800_cl_20I 


| |204 | |3976 ||1575||b24 ||700 ||4.9e-69 



Description 

gp: [GI:g4090864] [LN : AF023181] [AC : AF023 181] [PN:low temperature requirement B protein] 
[GN: ltrB] [OR:Listeria monocytogenes] [DB : genpept-bct2] [DErListeria monocytogenes 
transcription-repair coupling factor (mf dL) , low temperature requirement B protein 
(ltrB) , and DivIC homolog (divL) genes, complete cds . ] [LE:3972] [RE:5546] [DI:direct] 

NT AA 

ORF Name NT ID AAID — , — , score probability 
LengthLength *- 



AI7503000980_97g538_cl_183 | [205 | [3977 |pT7T] [45g | [F^~| 



9.0e-54 



Description 

Sp : [ LN : YAAO_BACSU] [AC:P37536] [GN: YAAO] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 53.2 
KD PROTEIN IN XPAC-ABRB INTERGENIC REGION] [SP:P37536] [DB : swissprot ] >pir: [LN:S66057] 
[AC:S66057 :F69737 ] [PN:lysine decarboxylase homolog yaaO] [GN:yaaO ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005805 :g467417] [LN:BAC180K] [AC:D26185] [PN:similar to 
lysine decarboxylase] [OR: Bacillus subtilis] [SR: Bacillus subtilis 
(sub_species : Marburg, strain: 168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE:101320] [RE:102762] [DI:direct] 
>gp: [GI:ell81960:g2632294] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yaaO] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
1 of 21): from 1 to213080.] [NT:similar to lysine decarboxylase] [SP:P37536] [LE:37718] 
[RE : 39160] [DI : direct] 



128 



ORF Name 


NT AA 

NTID AAID L — thL — th score probability 




12 06 Ijy /o ±Zo 1 p- 1 - 


Description 
NO-HIT 


ORF Name 


NT AA 

NTID AAID L — — th score probability 


AI750300098iJ.0392555J:3_I7 


207 13979 11180 1159 1 
1 II II 1 


Description 
NO-HIT 




ORF Name 


NT AA 

NTID AAID . — . , , — , , score probability 
Length Length c 


A17503000981_134392_r3_23 


| |208 | |3980 153 1 150 | 


Description 
NO-HIT 


ORF Name 


NT AA 

NTID AAID . — , — , score probability 
Leny t h Leng t h c *- 


AI7503000981_1385927_t3_22 


| |209 | |3981 |369 ||122 ||85 0.039 


Description 

gp: [GI:el332543:g3763999] [LN: PFMAL3P4] [AC : AL008970] [GN:MAL3P4 . 1] [OR : Plasmodium 
falciparum] [SR:malaria parasite P. falciparum] [DB : genpept - invl ] [DE : Plasmodium 
falciparum MAL3P4, complete sequence.] [NT : predicted using hexExon; MAL3P4.1 
(PFC0570C) , ] [LE:106887:107747] [RE : 107641: 108206] [DI : complement Join] 


ORF Name 


NT AA 

NTID AAID L — — th score probability 


|AI7503000981_i4882i35_t2_il 


| |210 | |3982 ||2112||703 ||167 |i.6e-09 | 



Description 

sp: [LN:MELR_ECOLI] [AC:P10411] [GN :MELR] [OR : ESCHERICHIA COLI] [DE :MELIBIOSE OPERON 
REGULATORY PROTEIN] [SP:P10411] [DB : swissprot ] >pir: [LN:RGECMB] 
[AC:A29625 :S56347 :E65221 ] [PN : melibiose operon regulatory protein] [GN: melR ] 
[CL:arabinose operon regulatory protein] [OR: Escherichia coli] [DB:pirl] [MP: 93 min ] 
>gp: [GI:g536963] [LN:ECOUW93] [AC:U14003] [GN :melR] [FN: regulatory gene] 
[OR Escherichia coli] [DB : genpept -bet 1] [DE : Escherichia coli K-12 chromosomal region 
from 92.8 to 00.1 minutes.] [NT : CG Site No. 18166] [LE:31548] [RE:32456] 
[DI: complement] >gp : [GI :gl790559] [LN: AE000484] [AC : AE000484 :U00096] [PN : regulator of 
melibiose operon] [GN:melR] [FN : regulator ; Degradation of small molecules:] 
[OR : Escherichia coli] [DB : genpept -bet 2] [DE : Escherichia coli K-12 MG1655 section 374 of 
400 of the completegenome.] [NT:f302; CG Site No. 18166] [LE:5093] [RE: 6001] 
[DI : complement] 



ORF Name 



_ ___ _ NT AA 

NTID AAID — , — , score 

LengthLength 



probability 



IA17^03000981 23634578 t3 18 



] I 211 I P" 3 IFHP 1 



Description 
NO-HIT 



ORF Name 



E 



17503000981 23850887 £2 13 



Description 
NO-HIT 



NTID AAID — 1 _ T _ T — , , score 
LengthLength 

|212 | [3984 ||46 | 



probability 



129 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
Length Length 



probability 



^1 7 503000981 25634627 ±2 12 



j |213 | [3985 ||45B | |155 ||290 ||1.4e-^~ 



Description 

sp: [LN:GS26_BACSU] [AC : P80238 : P96580] [GN: YDAG] [OR:BACILLUS SUBTILIS] [DE : GENERAL 
STRESS PROTEIN 26 (GSP26 ) ] [SP : P80238 : P96580] [DB : swissprot] >pir : [LN:G69768] 
[AC:G69768 ] [PNrgeneral stress protein homolog ydaG] [GN:ydaG ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl020013 : gl881233] [LN : AB001488] [AC : AB001488] [GN:ydaG] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB :genpept-bctl] [DE:Bacillus 
subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] 
[NT : FUNCTION UNKNOWN.] [LE:6862] [RE:7284] [DI:direct] >gp : [GI : ell823 88 : g2632722 ] 
[LN:BSUB0003] [AC : Z99106 : AL009126] [GN:ydaG] [FNrunknown] [OR:Bacillus subtilis] 
[DB : genpept - be t 1 ] [DE:Bacillus subtilis complete genome (section 3 of 21) : from 402751 
to611850.] [NT: alternate gene name: yzzA; similar to general] [SP:P80238] [LE : 70606] 
[RE:71028] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ ,, — , . score 
LengthLength 



probability 



E 



7503000981 2584538 Cl 29 



] |214 | [3986 |p^-[|43 | 



Description 
NO-HIT 



ORF Name 



NT 



AA 



F 



7503000981 29352342 c2 36 



NTID AAID — — , , score probabilit y 
LengthLength c 

2 |515 | [ 3987 | |r255-| |43i | [ 1218 | |5T3 e- 124 



Description 

sp : [ LN : GLTT_B AC S T ] [AC.-P24943] [GN : GLTT] [OR:BACILLUS STEAROTHERMOPHILUS] [DE : PROTEIN) ] 
[SP:P24943] [DB : swissprot ] >pir : [LN : S2624 7] [AC:S26247 ] [PN : glutamate/aspartate 
transport protein : sodium/proton-glutamate symport protein] [GNrgltP ] [OR: Bacillus 
stearothermophilus] [DB:pir2] >gp : [GI : gl43000] [LN : BACGLTPA] [AC :M86508] [PN:proton 
glutamate symport protein] [GN:gltP] [OR: Bacillus stearothermophilus] [SR: Bacillus 
stearothermophilus (library: ATCC 7954) DNA] [DB :genpept-bctl] [DE:Bacillus 
stearothermophilus proton glutamate symport protein (gltP)gene, complete cds . ] [LE:110] 
[RE : 1375] [DI : direct] 



ORF Name 



^17503000381 30081465 c2 37 



NTID 



AAID 



NT 



AA 



Description 
NO-HIT 



LengthLength 
[3988 | [TT7 | 



score probability 



ORF Name 



IAI7503000981 32242200 tl 2 



Description 
NO-HIT 



ORF Name 



AI7503000981_3396042_cl_34 

Description 

NO-HIT 



NT AA 

NTID AAID — - — , , score probab ility 

LengthLength 



] p 17 | | 3989 | | 378 | |125 | 



NT AA 

NTID AAID — T — , - score probab ility 
LengthLength *- 



|218 | [3990 ||315— 1|104 | 



130 



ORF Name 



IAI7503000981 34179828 c2 35 



Description 
NO-HIT 



NT AA 
NT ID AAID — _ — , , score 
LengthLength 



probability 



"] |219 | [3991 ||129""]|42 | 



ORF Name 



IAI7503000981 410033S ±2 10 



Description 
NO-HIT 

ORF Name 



LAI7503 



000981 6511652 c3 51 



Description 
NO-HIT 



ORF Name 



IAI7503000981 822786 ci 33 



Description 
NO-HIT 



ORF Name 



LL7503000982 10031712 ci 1057 



NT ID 
] |220 



_ _ _ _ NT AA 

AAID — — score 

LengthLength 



probability 



13992 



1153 I 



SIT 



AA 



NT 

NTID AAID — _ — , score 
LengthLength - 



probability 



] |221 | [3993 |[TTg | |¥5~ 



NT AA 
NTID AAID — _ T — . - score 
LengthLength 



probability 



TIT 



|3994 |p^~| 



NTID 



AAID 



NT 



AA 



Length Length 



score 



probability 



~| |223 | [3995 |[^r"| |329 |[597 ||4.ie-58 



Description 

sp: [LN:YQJA_BACSU] [AC:P54538] [GN: YQJA] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 37.1 
KD PROTEIN IN BMRU-ANSR INTERGENIC REGION] [SP:P54538] [DB : swissprot] >pir: [LN:A69963] 
[AC:A69963 ] [PN : hypothetical protein yqjA] [GNryqjA ] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:dl013287:gl303952] [LN : BACJH642] [AC : D84432 : D823 70] [PN: YqjA] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE:229147] 
[RE:230115] [DI: direct] >gp : [GI : e!185663 : g263482 9] [LN : BSUB0013] [AC : Z99116 : AL009126] 
[GN:yqjA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 13 of 21): from 2395261to 2613730.] [SP:P54538] [LE:93504] 
[RE: 944 72] [DI : complement ] 
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ORF Name 



U.7503000982 c3 1455 



NT AA 
NT ID AAID — , » _ — score 
LengthLength 



probability 



|224 | [3996 ||1236||411 ||996 p.ie-lW 



] 



Description 

sp: [LN:RS1H_BACSU] [AC:P38494] [GN : YPFD : JOFD] [OR:BACILLUS SUBTILIS] [DE:30S RIBOSOMAL 
PROTEIN SI HOMOLOG] [SP:P38494] [DB : swissprot] >pir : [LN : B6993 5] [AC:B69935 ] 
[PN: ribosomal protein SI homolog homolog ypfD] [GN :ypf D ] [CL : Synechocystis ribosomal 
protein SI] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g533106] [LN : BSU11687] [AC:U11687] 
[PN:unknown] [GNrjofD] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
168 jofA, jofB, MssA homolog (jofC) and ribosomalprotein SI homolog (jofD) genes, 
complete cds, and joeB gene, partial cds . ] [NT: similar to the Escherichia coli SI 
ribosomal] [LE:2651] [RE:3799] [DI: direct] >gp : [GI : ell83733 : g2634706] [LN : BSUB0012] 
[AC:Z99115:AL009126] [GN : ypfD] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 12 of 21): from 2195541to 2409220.] 
[NT:alternate gene name: jofD; similar to ribosomal] [SP:P38494] [LE:198361] 
[RE:199509] [DI : complement] >gp : [GI :gll46215] [LN:BACSERA] [AC:L47648] [GN: ypf D] 
[OR:Bacillus subtilis] [DB : genpept -be t2 ] [DE:Bacillus subtilis phosphoglycerate 
dehydrogenase (serA) , ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, 
ypbH, glutamate dehydrogenase (ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , 
ypeB, ypf A, ypfB, cytidine monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphc, 
NAD+ dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete 
cds.] [NT: similar to Escherichia coli SI ribosomal protein;] [LE: 16888] [RE: 18036] 
[DI: direct] 



ORF Name 



^17503000982 10334752 tl 178 



Description 
NO-HIT 

ORF Name 



^17503000982 10588877 t3 947 



Description 
NO-HIT 

ORF Name 



IA17503000982 10635918 c2 1290 



Description 
NO-HIT 



NT ID AAID 



NT AA 



LengthLength probability 



[3997 j|T52 [ |53 | 



NT AA 

NT ID AAID — _ „. — , , score probab ility 
LengthLength 



] |225 | [3998 ||162~||53 | 



NT AA 

NTID AAI D — Lv — , , sc ore probability 
LengthLength 



■j |227 | [3999 ||IZ3~||40 | 



ORF Name 



AI7503000982_i0S42i80_c3_15i3 

Description 

NO-HIT 



NT AA 

NTID AAID — _ — , , score probabi lity 
LengthLength c 



228 | [4000 |p^~| pT 



132 



NT AA 

0RF Name EHR Le ^thLe^th score P robabilit y 



AI7503000982_1066(S>06U_c2_1228 I [229 | [4001 ||330 | [109 [|196 ||1.3e-rS" 



Description 

sp: [LN:CMG3_BACSU] [AC:P25955] [GN : COMGC : COMG3 ] [OR : BACILLUS SUBTILIS] [DE : COMG OPERON 
PROTEIN 3 PRECURSOR] [SP:P25955] [DB : swissprot] >pir : [LN : D30338] 

[AC:D30338 :A35133 :D69603 ] [PN : exogenous DNA-binding protein comGCrcomG operon protein 
3] [GN:comGC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl42708] [LN : B AC COMG A] 
[AC:M29691:M22854] [GN:comG3] [OR:Bacillus subtilis] [SRrBacillus subtilis (strain 168) 
(clone: pED4) DNA] [DB : genpept-bctl] [DE: Bacillus subtilis (clone pED4) 
comG- (1,2,3,4, 5, 6, and 7) proteins incomG operon, complete cds . ] [LE:3405] [RE:3701] 
[DI: direct] >gp: [GI:dl 013214 :gl 303879] [LN: BACJH642] [AC : D84432 : D823 70] [PN:ComGC] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] 
[DB: genpept-bctl] [DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:161635] [RE:161931] [DI: direct] >gp : [GI :ell85739:g2 634905] [LN : BSUB0013 ] 
[AC:Z99116 :AL009126] [GN:comGC] [FN : exogenous DNA-binding (competence)] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 13 of 21) : 
from 2395261to 2613730.] [SP:P25955] [LE:161690] [RE:161986] [DI : complement] 

NT AA 

ORF Name NTID AAID — — , . score probab ility 
LengthLength c 



A?7g03000982_i070437_c3_i482 | |230 | [4002 |p^T~||297 ||727 ||6.8e-72 



Description 

sp: [LN:YPCP_BACSU] [AC:P54161] [GN:YPCP] [OR:BACILLUS SUBTILIS] [EC : 3 . 1 . 11 . - ] 
[DE : POTENTIAL 5 ' -3 » EXONUCLEASE , ] [SP:P54161] [DB : swissprot] >pir : [LN : H6 9933] 
[AC:H69933 ] [PN:5'-3' exonuclease homolog ypcP] [GN:ypcP ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI :gl256623] [LN : BACYACA] [AC:L77246] [PN : exodeoxyribonuclease] 
[GN:ypcP] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis (YAC10-9 
clone) DNA region between the serA andkdg loci.] [NT: 36.1% identity with 219 aa at the 
5' end of the] [LE:9565] [RE:10455] [DI:direct] >gp : [GI : ell83647 :g2634620] 
[LN:BSUB0012] [AC : Z99115 : AL009126] [GN:ypcP] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 12 of 21) : from 
2195541to 2409220.] [NT:similar to 5'-3' exonuclease] [SP:P54161] [LE:114697] 
[RE : 115587] [DI : complement] 

NT AA 

ORF Name NTID AAID — — . _ score probab ility 
LengthLength c -L - 



AI7503000982_10938927_i:2_492 | |231 | [4003 |[I31~| | 43 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ T — . , score probab ility 
LengthLength L 



A17503000982_10969050_c2_il93 | [ 232 | [ 4004 | p3^F] |S31 [ p.815 [ |3 . 5e- T8T 



Description 

sp : [LN : LEPA_BACSU] [AC:P37949] [GN : LEPA] [OR:BACILLUS SUBTILIS] [DE : GTP- BINDING PROTEIN 
LEPA] [SP:P37949] [DB : swissprot] >pir : [LN:G69649] [AC:G69649 ] [PN : GTP -binding protein 
lepA] [GN : lepA ] [CL:GTP-binding membrane protein lepA: translation elongation factor Tu 
homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013139 :gl303804] [LN: BACJH642] 
[AC:D84432 :D82370] [PN: YqeQ] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl)) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 86866] [RE: 88704] [DI: direct] 

>gp: [GI:e200671:gll22398] [LN : BSLEPORF] [AC:X91655] [GN: lepA] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis lepA and hemN genes.] [SP:P37949] [LE:128] [RE:1966] 
[DI:direct] >gp: [GI :ell83781:g2 634997] [LN: BSUB0014] [AC : Z99117 : AL009126] 
[PN: GTP -binding protein] [GN : lepA] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT alternate gene name: yqxB, yqeQ] [SP:P37949] [LE:30726] [RE:32564] [DI : complement] 
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ORF Name 



NT ID 



AAID 



NT 



AA 



LenqthLength 



score probability 



[AI 7 503000982 11194067 c3 13 7 8 



] F n I | 4005 |[ 747 [I 248 |pT-||1.6e-40 



Description 

Sp : [LN : YQEM_BACSU] [AC:P54458] [GN : YQEM] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 28.3 
KD PROTEIN IN AROD- COMER INTERGENIC REGION] [SP:P54458] [DB : swissprot] >pir : [LN : A6 9952] 
[AC:A69952 ] [PN : conserved hypothetical protein yqeM] [GN:yqeM ] [CLrbioC homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013129 :gl303794] [LN: BACJH642] 
[AC:D84432 :D82370] [PN : YqeM] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 76770] [RE: 77513] [DI: direct] 

>gp: [GI:ell83791:g2635007] [LN:BSUB0014] [AC : Z99117 : AL009126] [GN:yqeM] [FN:unknown] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 
14 of 21): from 2599451to 2812870.] [NT:similar to hypothetical proteins] [LE:41917] 
[RE : 42660] [DI : complement] 



ORF Name 



IAI7503000382 11203753 cl 585 



Description 
NO-HIT 



NT 



AA 



NTID AAID , — ^, „ — _ score probability 
LenqthLength c *- 



ORF Name 



|AI750300098a 11881313 tl 285 



Description 
NO-HIT 



NTID 



NT 



AA 



AAID , 



LengthLength 
|235 | [4007 |[T73 | [57 | 



score probability 



ORF Name 



IAI7503000982 12115552 ±2 354 



Description 
NO-HIT 



NTID 



„ TT ^ NT AA 

AAID — , T — , , score 

LengthLength 



probability 



236 | [4008 ||T23 | [40 | 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



AI7503000982 12187b0 Cl 996 



probability 
j [237 | [4009 [pi - 1|226 |p36 ||i.8e-30 



Description 

gp: [GI:g3211753] [LN : AF0522 08] [AC : AF0522 08 ] [PN : competence protein] [GN: celA] 
[OR: Streptococcus pneumoniae] [DB : genpept -bct2 ] [DE : Streptococcus pneumoniae competence 

protein (celA) and competenceprotein (celB) genes, complete cds; and unknown gene.] 
[LE:266] [RE: 916] [DI:direct] 



ORF Name 



NTID 



„ Tr% NT AA 

AAID — . T — , . score 

LengthLength 



probability 



IAI7503000982 1225553 c2 1270 



Description 
NO-HIT 



[4010 |praT~| |45 | 



ORF Name 



NTID 



par 



7503000582 1229750 c3 1419 



TUTT 



NT AA 

AAID 

LengthLength 

1458 1033" 



score 



probability 
2.3e-48 



Description 

gp: [GI:el363305:g4127534] [LN : BSAJ10954] [AC : AJ010954] [PN:arginine repressor] 
[GN:argR] [FN: ADN binding protein] [OR:Bacillus stearothermophilus] [DB : genpept- be tl] 
[DE: Bacillus stearothermophilus argR gene and partial recN gene.] [LE:196] [RE: 645] 
[DI : direct] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000982 1345752 13 780 



-| |240 | [4012 | pTT 



72TT 



probability 
] |J.Be-71 



Description 

sp: [LN:YQKF_BACSU] [AC:P54569] [GN : YQKF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 

OXIDOREDUCTASE IN ANSR-BMRU INTERGENIC REGION] [SP:P54569] [DB : swissprot] 

>pir : [LN:H69966] [AC:H69966 ] [PN : conserved hypothetical protein yqkF] [GN:yqkF ] 

[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013318 : gl303983] [LN : BACJH642] 

[AC:D84432 :D82370] [PN: YqkF] [OR:Bacillus subtilis] [SR:Bacillus subtilis 

(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept - be t 1 ] [DE:Bacillus subtilis DNA, 283 Kb 

region containing skin element.] [LE: 259404] [RE: 260324] [DI : complement] 

>gp: [GI:ell85631:g2634797] [LN : BSUB0013] [AC : Z99116 : AL009126] [GN:yqkF] [FN:unknown] 

[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 

13 of 21): from 2395261to 2613730.] [NT:similar to hypothetical proteins] [SP:P54569] 

[LE:63295] [RE:64215] [DI: direct] 



ORF Name 



E 



17303000582 1359450 ±2 544 



Description 
NO-HIT 



NTID AAID 



NT AA 
LengthLength 
| [4013 |[144 1 147 | 



score 



probability 



ORF Name 



NT 

NTID AAID . — 



AA 



LengthLength 



score probability 



|A17S030009Ba 13606S tl 22 7 



[4014 |p^uT"[|4&7 | [1201 ||4.0e-122 



Description 

pir: [LN:D70585] [AC:D70585 ] [PN:probable glyS protein] [GN:glyS ] [CL Mycoplasma 

genitalium glycine- -tRNA ligase] [OR : Mycobacterium tuberculosis] [DB:pir2] 

>gp: [GI:e315164:g2078043] [ LN : MTCY2 7 ] [AC : Z952 08 : AL123456] [PN:glyS] [GN:glyS] 

[OR : Mycobacterium tuberculosis] [DB : genpept -bet 1] [DE : Mycobacterium tuberculosis H37Rv 

complete genome; segment 104/162.] [NT:Rv2357c, (MTCY27 . 23-MTCY98 . 26 ) , len: 463 aa. 

glyS,] [LE:19] [RE: 1410] [DI : complement ] 



ORF Name 



NT AA 
NTID AAID — - — , , score 
LengthLength 



probability 



1AI7503000982 13674130 r3 703 



Description 
NO-HIT 



] P 43 1 1 4015 IP^Hl 42 | 



ORF Name 



NTID AAID — _ score 
LengthLength 



probability 



p7503000982_i3853500_i2_374 

Description 

NO-HIT 



j |244 j [4015 



I 



ST" 



135 



NT AA 

QRF Name ™ L e^thLe^th ^g probability 



AI7503000982_13875943__cl_ii22 | |245 | [ 401 7 | | 1488 | | 495 || 882 | p.6e-88 



Description 

pir: [LN:B69610] [AC : B69610 : JC5744 ] [PN : carboxy- terminal processing proteinase 
ctpA, : tail-specif ic endopeptidase Pre] [GN : ctpA ] [CL: carboxyl- terminal processing 
proteinase] [ORrBacillus subtilis] [EC : 3 . 4 . 99 . - ] [DB:pir2] >gp : [GI :g2529476] 
[LN:AF006665] [AC : AF006665] [PN:OrfRMl] [GN : orf RMl] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis 168 region at 182 min containing the cge 
genecluster . ] [NT: similar to the E. coli Pre and carboxyl -terminal] [LE: 22 886] 
[RE:24286] [DI:direct] >gp : [GI :g2415395] [LN: AF015775] [AC : AF015775] [PN : proteinase] 
[GN: ctpA] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis YodA (yodA), 
YodB (yodB) , YodC (yodC) , YodD(yodD), ABC-transporter (yodE) , permease (yodF) , 
proteinase (ctpA) , YodH (yodH), YodI (yodl) , carboxypeptidase (yodJ) , purinenucleoside 
phosphorylase (deoD) , YodL (yodL) , YodM (yodM) , YodN(yodN), YodO (yodO) , YodP (yodP) , 
acetylornitine deacetylase (argE) , butirate-acetoacetate CoA transferase (yodR) , 
butyrateacetoacetate-CoA transferase (yodS) , YodT (yodT) , CgeE (cgeE) , CgeD(cgeD), CgeC 
(cgeC) , CgeA (cgeA) , CgeB (cgeB) , YzxA (yzxA) ,UDP-glucose epimerase (yodU) , YodV 
(yodV) , and YodW (yodW) genes , complete cds; and YodZ (yodZ) gene, partial cds.] 
[NT: similar to Synechocystis sp. PCC6803] [LE:5846] [RE: 7246] [DI : complement] 
>gp: [GI:ell85430:g2634351] [LN : BSUB0011] [AC : Z99114 : AL009126] [PN : carboxy- terminal 
processing protease] [GN : ctpA] [OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus 
subtilis complete genome (section 11 of 21): from 2000171to 2207900.] [NT alternate 
gene name: yzbD] [LE: 130976] [RE: 132376] [DI : complement] 

NT AA 

ORF Name NT ID AAID — , — , score probability 
LengthLength c 



AlV5030009aa_14097011_c3_i4il | |246 | [ 4018 |p2~ || 203 | p2 ||3.6e-50 



Description 

sp: [LN : ARGJ_BACST] [AC:Q07908] [GN : ARG J] [OR: BACILLUS S TE AROTHE RMO PH I LUS ] 
[EC:2.3.1.35:2.3.1.1] [DE : ACETYLTRANSFERASE , (N-ACETYLGLUTAMATE SYNTHASE) (AGS)] 
[SP:Q07908] [DB : swissprot] >pir : [LN: 139766] [AC:I39766 ] [PN : glutamate 

N-acetyltransferase, ] [GN:argJ ] [CL : glutamate N-acetyltransf erase argJ] [OR:Bacillus 

stearothermophilus] [EC : 2 . 3 . 1 . 35] [DB:pir2] >gp : [GI:g304135] [LN : B AC ACETYL] [AC:L06036] 
[PN: ornithine acetyl transferase] [GN:argJ] [OR: Bacillus stearothermophilus] 
[SR:Bacillus stearothermophilus (strain NCIB 8224) DNA] [DB : genpept-bctl] [EC : 2 . 3 . 1 . 35] 
[DE: Bacillus stearothermophilus ornithine acetyl transferase (argJ) andacetylglutamate 

kinase (argB) genes, complete cds's, argC gene, 3»end, and argD gene, 5» end.] [NT:also 

bears acetyl -CoA : L-glutamate] [LE:902] [RE:2134] [DI:direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|AI75030009fl2_i425963i_ci_S67 | |247 | | 40i9 |[TUFr|| 355 | p7HT] p . 3e-1^3 | 

Description 

Sp : [LN : QUEA_BACSU] [AC:O32054] [GN : QUE A] [OR:BACILLUS SUBTILIS] [EC : 5 . -.-.-] 
[DE: (QUEUOSINE BIOSYNTHESIS PROTEIN QUEA) ] [SP:O32054] [DB : swissprot] >pir : [LN : A69688] 
[AC:A69688 ] [ PN : S - adenosylme thionine : tRNA ribosyltransf erase- isomerase , :queuosine 
biosynthesis-related protein queA] [GN : que A ] [CL : S-adenosylmethionine : tRNA 
ribosyltransf erase- isomerase] [OR:Bacillus subtilis] [EC: 5.-.-.-] [DB:pir2] 
>gp: [GI :ell84021:g263 5237] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN: S- adenosylme thionine 
tRNA ribosyltransf erase] [GN : queA] [FN : queuosine biosynthesis] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 15 of 21): from 
2795131to 3013540.] [SP:O32054] [LE:38030] [RE:39058] [DI : complement] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



|Al 7 503000982_14460932_ll_322 



] | 248 | | 4020 || 177 || ba | p25 | [ 



probability 
8 . 4e-19 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PNrunknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 0RF1 and 0RF2 genes, completecds . ] [NT :0RF1] 

[LErllOl] [RE: 1922] [D I : complement] 



ORF Name 



NTID 



_ _ NT AA 

AAID — _ — , , score 

LengthLength 



probability 



^7503000982 14507827 tl 87 



][ 



Description 

NO-HIT 

i — - _ = 

ORF Name 



[4021 |p^~| |49 | 



NTID AAID 



NT 



AA 



LengthLength 



score 



IA17503000982 14508567 c3 1357 



] [4022 |p75~||9i |[T5T" 



probability 
|2.4e-14 



Description 

pir: [LN:E69972] [AC:E69972 ] [PN: conserved hypothetical protein yrbF] [GNiyrbF ] 
[CL:yajC protein] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84019 :g2635235] 
[LN:BSUB0015] [AC : Z99118 : AL009126] [GN:yrbF] [FNrunknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 15 of 21) : from 
2795131to 3013540.] [NT: similar to hypothetical proteins] [LE: 36555] [RE: 36824] 
[DI : complement] 



ORF Name 



NTID 



„ TO NT AA 

AAID — . T — , . score 

LengthLength 



IAI7503000982 14881587 c2 1226 





251 




4023 


993 



probability 
I2.7e-148 



Description 

sp : [LN : GLK_STAXY] [AC:Q56198] [GN : GLKA] [OR : STAPHYLOCOCCUS XYLOSUS] [EC:2.7.1.2] 
[DE : GLUCOKINASE , (GLUCOSE KINASE)] [SP:Q56198] [DB : swissprot] >pir : [LN : S52352 ] 
[AC:S52352 ] [PN:glucose kinase] [CL:glucose kinase : glucose kinase homology] 
[OR staphylococcus xylosus] [DB:pir2] >gp : [GI : g666116] [LN: SXGKG2] [AC:X84332] 
[PN: glucose kinase] [GN:glkA] [OR : Staphylococcus xylosus] [DB : genpept-bctl] 
[DE:S. xylosus glucose kinase gene.] [SP:Q56198] [LE:973] [RE: 1959] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



F 



17503000982 15031535 t2 658 



Description 
NO-HIT 



] I 252 1 |4024 [p3^~||5i | 



ORF Name 



^17503000982 15728386 c3 1355 



NTID AAID 



NT 



AA 



LengthLength 



score 



] |253 | [4025 |pTTTT]|380 |p3*2"| 



probability 
|2.ye-ib8 " 



Description 

sp : [LN : TGT_BACSU] [AC:O32053] [GN : TGT] [OR:BACILLUS SUBTILIS] [EC : 2 . 4 . 2 . 2 9] 
[DE : TRANS GLYCOSYLASE) (GUANINE INSERTION ENZYME) ] [SP:O32053] [DB : swissprot] 
>pir: [LN:B69722] [AC:B69722 ] [PN:queuine tRNA-ribosyltransf erase , ] [GN:tgt ] 
[CL:queuine tRNA-ribosyl transferase] [OR:Bacillus subtilis] [EC : 2 . 4 . 2 . 29] [DB:pir2] 
>gp: [GI:ell84020:g2635236] [LN: BSUB0015] [AC : Z99118 : AL00912 6] [PN : tRNA- guanine 
transglycosylase] [GN:tgt] [FN : queuosine biosynthesis] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [EC : 2 . 4 . 2 . 29] [DE:Bacillus subtilis complete genome (section 15 of 
21): from 2795131to 3013540.] [SP:O32053] [LE:36858] [RE:38003] [DI : complement] 
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ORF Name 



NT ID 



NT AA 
AAID — , — ^ score 
LengthLength 



probability 



E 



17503000982 16205035 cl 1035 



][ 



254 



[4025 p^" 



Description 
NO-HIT 



NTID 



A7VTr , NT AA 
AAID — , , _ — , . score 
LengthLength 



[2inr 



[4027 |[^~ 



probability 
9.0e-38 



ORF Name 

[AI7503000552_iggi0088_c2_i25i 
Description 

sp: [LN:YPAA_BACSU] [AC:P50726] [GN: YPAA] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 20.5 
KD PROTEIN IN SERA-FER INTERGENIC REGION] [SP:P50726] [DB : swissprot] >pir : [LN : E69932] 
[AC:E69932 ] [PN: hypothetical protein ypaA] [GNrypaA ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell85574 :g2634740] [LN : BSUB0013 ] [AC : Z99116 : AL009126] [GN : ypaA] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [SP:P50726] [LE:13991] [RE:14563] [DI : complement] 
>gp: [GI:gll46197] [LN : BACSERA] [AC:L47648] [GNrypaA] [OR:Bacillus subtilis] 
[DB : genpept -bet 2 ] [DEtBacillus subtilis phosphoglycerate dehydrogenase (serA) , 
ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glut amate dehydrogenase 
(ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, ypfB, cytidine 
monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 

dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes, complete cds . ] 
[NT:putative] [LE:2114] [RE: 2686] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



probability 



E 



17503000982 16828175 c3 1395 



][ 



[4028 [pre - 1|201 ||1021 ||4.8e-103 



Description 

gp: [GI:g4325247] [LN : AF121672 ] [AC : AF121672] [PN : superoxide dismutase SodA] [GN : sodA] 
[OR staphylococcus aureus] [DB : genpept -bct2] [DE : Staphylococcus aureus superoxide 

dismutase SodA (sodA) gene , complete cds.] [NT : manganese -dependent] [LE:246] [RE:845] 
[DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



IAI7503000982 15923383 c2 1252 



] p 57 1 [4029 |p^~[|112 |p5" 



probability 
3.1e-ll 



Description 

Sp: [LN: YQJQ_BACSU] [AC:P54554] [GN: YQJQ] [OR: BACILLUS SUBTILIS] [EC:1. -.-.-] [DE:(EC 
1. -.-.-)] [SP:P54554] [DB : swissprot] >pir : [LN : A6 9965] [AC:A69965 ] [PN:ketoacyl 
reductase homolog yqjQ] [GN:yqjQ ] [CL : short -chain alcohol dehydrogenase homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013303 :gl303968] [LN : BACJH642] 
[AC:D84432 :D82370] [PN: YqjQ] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE : 247869] [RE: 248648] [DI: direct] 

>gp: [GI:ell85647:g2634813] [LN: BSUB0013] [AC : Z99116 : AL009126] [GN:yqjQ] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21) : from 2395261to 2613730.] [NT:similar to ketoacyl reductase] [SP:P54554] 
[LE: 74971] [RE: 75750] [DI : complement] 
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NT AA 

ORFJTame NTID AA1D Le ~ thLe ~ th sg°£e probability 



AI75030009aa_lV002217_c2_iai>4 | |258 | [ 4030 |pF^[| lS4 | p^7~[| 4 . 7e-57 



Description 

sp: [LN:FUR2_BACSU] [AC:P54574] [GN: YQKL] [OR:BACILLUS SUBTILIS] [DE : FERRIC UPTAKE 
REGULATION PROTEIN HOMOLOG 2] [SP:P54574] [DB : swissprot] >pir : [LN: E69967] [AC:E69967 ] 
[PN: transcription regulator Fur family homolog yqkL] [GN:yqkL ] [CL: ferric uptake 
regulator] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013328 :gl303993] [LN: BACJH642] 
[AC:D84432 :D82370] [PN: YqkL] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 269361] [RE: 269810] [DI: direct] 

>gp: [GI:ell85621:g2634787] [LN:BSUB0013] [AC : Z99116 : AL009126] [GN:yqkL] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NT:similar to transcriptional regulator (Fur 
family)] [SP:P54574] [LE: 53809] [RE: 54258] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ „ — , , score prob ability 

Length Length 



|mS03000$fl2jL87633_clJ.074 | [259 | |403i |p7^~ [ | 323 | p7T7~[ p . 8e-12 

Description 

Sp : [LN : YPBB_BACSU] [AC:P50728] [GN: YPBB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 40.7 
KD PROTEIN IN FER-RECQ INTERGENIC REGION] [SP:P50728] [DB : swissprot] >pir : [LN: F69932] 
[AC:F69932 ] [PN : hypothetical protein ypbB] [GN:ypbB ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83748:g2634721] [LN: BSUB0012] [AC : Z99115 : AL009126] [GN : ypbB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 

12 of 21): from 2195541to 2409220.] [SP:P50728] [LE:212099] [RE:213157] [DI : complement] 
>gp: [GI:ell85572:g2634738] [LN:BSUB0013] [AC : Z99116 : AL009126] [GN : ypbB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 

13 of 21): from 2395261to 2613730.] [SP:P50728] [LE:12379] [RE:13437] [DI : complement] 
>gp: [GI:gll46199] [LN :BACSERA] [AC:L47648] [GN:ypbB] [OR:Bacillus subtilis] 
[DB:genpept-bct2] [DE:Bacillus subtilis phosphoglycerate dehydrogenase (serA) , 

ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate dehydrogenase 
(ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, ypfB, cytidine 
monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphc, NAD+ 

dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete cds . ] 
[NT:putative] [LE:3240] [RE:4298] [DI: direct] 



ORF Name 



NTID 



IAI7S03000982 189203 c2 1199 



][ 



NT AA 

AAID , , 

LengthLength 

[4032 | [7^ | 



score 



probability 
3.9e-53 



Description 

sp: [LN: YQEU__BACSU] [AC:P54461] [GN: YQEU] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 28.8 
KD PROTEIN IN DNAJ-RPSU INTEREGENIC REGION] [SP:P54461] [DB : swissprot] >pir: [LN:D69952] 
[AC:D69952 ] [PN : conserved hypothetical protein yqeU] [GN:yqeU ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl013146 :gl303811] [LN: BACJH642] [AC : D84432 : D82370] [PN:YqeU] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl)) DNA] 
[DB: genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:95803] [RE:96573] [DIrdirect] >gp : [GI:ell837 74 :g2 634990] [LN : BSUB0014] 
[AC:Z99117:AL009126] [GN:yqeU] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT: similar to hypothetical proteins] [SP:P54461] [LE: 22857] [RE: 23627] [DI : complement] 
>gp: [GI:dl012752:gl890060] [LN:D83717] [AC:D83717] [PN:YqeU] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain : JH642 (trpC2 pheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus 
subtilis DNA for DnaJ, YqeT, YqeU, YqeV, YqeW, YqeX, YqeY, complete and partial cds.] 
[NT: Similar to 26.9 kDa protein (YggJ) of E. coli] [LE:1334] [RE: 2104] [DI: direct] 
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NT AA 

ORF Name NTID AAID Le ~ thLen ^ th score probability 



AI 7 5030009 82J.954093 1_c2_119i [ | 261 | [ 4033 |p?u~| |739 ||gg7—|| 2 ■ Oe-63 



Description 

sp: [LN:CME3_BACSU] [AC:P39695] [GN : COMEC : COME3] [OR : BACILLUS SUBTILIS] [DE : COME OPERON 
PROTEIN 3] [SP:P39695] [DB : swissprot ] >pir : [LN : S39865] [AC : S39865 : E69602 ] [PNrlate 
competence protein 3 (comE operon) ] [GN:comEC ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:g289262] [ LN : BACCOME ] [AC:L15202] [OR:Bacillus subtilis] [SRrBacillus subtilis 
DNA] [DB:genpept-bct2] [DE: Bacillus subtilis comE operon encoding ORF1, ORF2 , ORF3 
andReverse -ORF genes, complete cds . ] [LE:2603] [RE:4933] [DIrdirect] 



NT AA 

ORF Name NTID AAID — __ — . score probability 
LengthLength 



Ai:75030009S2_i55675aa_c3_i456 | | [ 4034 |[5ig~|| i70 |pT~|| 6 ■ Be-42 



Description 

sp: [LN:PTGA_BACST] [AC:P42015] [GN: PTSG] [OR:BACILLUS STEAROTHERMOPHILUS] [EC : 2 . 7 . 1 . 69] 
[DE : COMPONENT) , (EII-GLC / EIII-GLC) (FRAGMENT)] [SP:P42015] [DB : swissprot] 
>gp: [GI :g529001] [LN : BSU12340] [AC:U12340] [PN:PTS glucose- specif ic permease] 
[GN :ptsG 1 ] [OR:Bacillus stearothermophilus] [DB :genpept-bctl] [DE:Bacillus 
stearothermophilus XL-65-6 phosphoenolpyruvate-dependentphosphotransf erase system 
glucose-specific permease (ptsG') gene, partial cds, HPr (ptsH) , enzyme I (ptsl) , and 
PtsT (ptsT) genes , complete cds, and wall associated protein precursor (wapA') 
gene, complete cds.] [NT : thermophilic, cytoplasmic protein] [LE:<1] [RE:976] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , , score probab ility 
LengthLength c £ - 



IAI7503000982 19567812 c3 1495 



|253 | [4035 |p5~|[154 ||g7B~||l. 7e-4S 



Description 

sp: [LN:YPPQ_BACSU] [AC:P54155] [GN: YPPQ] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 16.6 
KD PROTEIN IN ILVA 3 1 REGION] [SP:P54155] [DB : swissprot] >pir : [LN : F69940] [AC:F69940 ] 
[PN: transcription regulator PilB family homolog yppQ] [GN : yppQ ] [CL :hypthetical 
protein YCL033c] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl256654] [LN : BACYACA] 
[AC:L77246] [GN: yppQ] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
(YAC10-9 clone) DNA region between the serA andkdg loci.] [NT: 54. 8% identity with 
Neisseria gonorrhoeae] [LE:33923] [RE:34354] [DIrdirect] >gp : [GI : ell83615 :g2634588] 
[LN:BSUB0012] [AC : Z99115 : AL009126] [GN:yppQ] [FN:unknown] [ORrBacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 12 of 21) : from 
2195541to 2409220.] [NT: similar to transcriptional regulator (PilB family)] [SP:P54155] 
[LE : 90798] [RE : 91229] [DI : complement] 

NT AA 

ORF Name NTID AAID m — — t score probability 
LengthLength c L - 



AI7503000992_i9500i7_t3_809 | |254 | [4036 |p"^~| pre 



Description 
NO-HIT 
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NT AA 

ORFName NTID AAID ^~ thLe ^ th g2re probability 



AI750300098:QL9B04a3aj:2J385 | | [ 4037 ||7 17 | [ 238 | | b49 || B,0e-53 



Description 

sp: [LN:YPDP_BACSU] [AC:P54163] [GN : YPDP] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 25.7 
KD PROTEIN IN BCSA-DEGR INTERGENIC REGION] [SP:P54163] [DB : swissprot ] >pir : [LN : C6 9934] 
[AC:C69934 ] [PN : conserved hypothetical protein ypdP] [GN:ypdP ] [CL : Archaeoglobus 
fulgidus conserved hypothetical protein AF2110] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:gl256625] [LN : BACYACA] [AC:L77246] [GN:ypdP] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis (YAC10-9 clone) DNA region between the serA 
andkdg loci.] [NT : putative] [LE: 11031] [RE: 11720] [DI : complement] 

>gp: [GI:ell83645:g2634618] [LN: BSUB0012] [AC : Z99115 : AL009126] [GNrypdP] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
12 of 21): from 2195541to 2409220.] [NT:similar to hypothetical proteins] [SP:P54163] 
[LE: 113432] [RE: 114121] [DI: direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



&I75030009fl2_20006377_c3_i352 | |266 | [ 4038 |p2~ ] |243 | | 432 || i.2e-40 



Description 

Sp : [LN : YQFN_BACSU] [AC:P54471] [GN : YQFN] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 23.7 
KD PROTEIN IN CCCA-SODA INTERGENIC REGION] [SP:P54471] [DB : swissprot] >pir : [LN : H69953] 
[AC:H69953 ] [PN : conserved hypothetical protein yqfN] [GN:yqfN ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl013170 :gl303835] [LN : BACJH642 ] [AC : D84432 : D82370] [PN : YqfN] 
[OR:Bacillus subtilis] [SRrBacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:120338] [RE:120988] [DI:direct] >gp : [GI : ell85785 : g2634951] [LN : BSUB0013 ] 
[AC:Z99116 :AL009126] [GN:yqfN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT:similar to hypothetical proteins] [SP:P54471] [LE:202632] [RE:203282] 
[DI : complement] 



NT AA 

ORF Name NTID AAID — , — , , score probabilit y 
LengthLength ^ L - 



A17b0a000582_20035967_cl_1052 | |267 | [4039 [[TuTT] | 352 ||922 | | 1.5e-92 



Description 

pir: [LN:H69636] [AC:H69636 ] [PN : glycerol -3 -phosphate dehydrogenase (NAD+) , gpsA] 
[GN:gpsA ] [CL: glycerol -3 -phosphate dehydrogenase] [OR:Bacillus subtilis] [EC:1.1.1.8] 
[DB:pir2] >gp: [GI : el 183 72 8 :g2 634701] [LN :BSUB0012] [AC : Z99115 : AL009126] 

[PN:NAD(P)H-dependent glycerol -3 -phosphate] [GN:gpsA] [FN: synthesis of the sn-glycerol 
3-phosphate] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 1 . 1 . 1 . 94] [DE:Bacillus 
subtilis complete genome (section 12 of 21): from 2195541to 2409220.] [SP:P46919] 
[LE:192848] [RE:193885] [DI : complement] >gp : [GI :gll46220] [LN : BACSERA] [AC:L47648] 
[PN:NAD+ dependent glycerol -3 -phosphate] [GN:glyC] [OR:Bacillus subtilis] 
[DB:genpept-bct2] [EC : 1 . 1 . 1 . 94] [DE:Bacillus subtilis phosphoglycerate dehydrogenase 
(serA) , ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate 
dehydrogenase (ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, 
ypfB, cytidine monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 
dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete cds . ] 
[LE:22512] [RE:23549] [DI: direct] 



ORF Name 



NT AA 

NTID AAID — — , . score prob ability 
LengtnLengtn * 



IAlV503000a82 200S4642 t2 341 



268 | [4040 [p3~ ||45 | 



Description 
NO-HIT 
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ORF Name 



U 7 S03000982 203832b F2 381 



Description 
NO-HIT 



NT AA 
NTID AAID — _ — score 
LengthLenqth 



probability 



] p 59 1 [4041 |[rffTT~| [59 | 



ORF Name 



p7g03000982_204857i2_c3_i498 



Description 
NO-HIT 



NTID 



TUT 



^ m NT AA 

AAID — . _ — , . score 

LengthLenqth 



probability 



[4042 |J575 [ |Tgu~ 



ORF Name 



NTID 



w7VT ^ NT AA 

AAID — — . , score 

Length Length 



probability 



a7503000982 20503437 c3 1455 



][ 



TTT 



[4043 ||X?3 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — — score 
Length Length 



probability 



AI7503000982 20507937 c2 1280 



] [272 | [4044 |pTT||438 [11823 114,96-188 



Description 

sp: [LN:SYN_BACSU] [AC:P39772] [GN:ASNS] [OR:BACILLUS SUBTILIS] [EC : 6 . 1 . 1 . 22] 
[DE : (ASNRS) ] [SP:P39772] [DB : swissprot] >pir: [LN:B69591] [AC : B69591 : 140523 ] 
[PN:asparagine--tRNA ligase, asnS : asparaginyl-tRNA synthetase asnS] [GN:asnS ] 
[CL: lysine- -tRNA ligase] [OR:Bacillus subtilis] [EC : 6 . 1 . 1 . 22] [DB:pir2] 
>gp: [GI:gll46247] [LN : BACYPIA] [AC:L47709] [PN : asparaginyl - tRNA synthetase] [GN:asnS] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 6 . 1 . 1 . 22] [DErBacillus subtilis (clone 
YAC15-6B) ypiABF genes, qcrABC genes , ypjABCDEFGHI genes, birA gene, panBCD genes, dinG 
gene, ypmB gene,aspB gene, asnS gene, dnaD gene, nth gene and ypoC gene, 
completecds ' s . ] [NT: 41.1% of identity to the Escherichia coli] [LE : 20449] [RE: 21741] 
[DI:direct] >gp: [GI : ell83681 :g2634654] [LN : BSUB0012] [AC : Z99115 : AL009126] 
[PN: asparaginyl-tRNA synthetase] [GN:asnS] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC:6.1.1.22] [DE:Bacillus subtilis complete genome (section 12 of 21): from 2195541to 
2409220.] [SP:P39772] [LE:149926] [RE:151218] [D I : complement] 
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NT AA 

ORF Name NT ID AAID Le ^ t fr Le ^ t h score probability 

|A17503000982_20bi713b_c3_14S 7 | [273 | [ 4045 [[ 315 | [ 104 || 351 ||4.1e-33 | 

Description 

sp: [LN:DBH_BACST] [AC : P02346 : P08822 ] [GN : HBS : HBSU] [OR:BACILLUS 

STEAROTHERMOPHILUS: BACILLUS CALDOLYT I CUS : BACILLUS CALDOTENAX] [DE : DNA-BINDING PROTEIN 
II (HB) (HU)] [SP:P02346 :P08822] [DB : swissprot] >pir: [LN:DNBS2F] 

[AC : JC1205 : A02690 : JC2509 ] [PN : DNA- binding protein HU: DNA- binding protein II] [GN:hup ] 
[CL :bacterial DNA-binding protein] [OR:Bacillus stearothermophilus] [DBrpirl] 

>pir: [LN: JC1207] [AC:JC1207 ] [PN : DNA-binding protein HU] [GNrhup ] [CL : bacterial 

DNA-binding protein] [OR:Bacillus caldolyticus] [DB:pir2] >pir: [LN:JC1206] [AC:JC1206 ] 
[PN: DNA-binding protein HU] [GN:hup ] [CL : bacterial DNA-binding protein] [OR: Bacillus 

caldotenax] [DB:pir2] >gp : [GI :dl007851 :gl065992] [LN : BACDBPHU] [AC:D38080] [ PN : DNA 

binding protein HU] [OR: Bacillus stearothermophilus] [SR: Bacillus stearothermophilus 
(strain:1503) DNA] [DB :genpept-bctl] [DE:Bacillus stearothermophilus gene for DNA 

binding protein HU, complete cds . ] [LE:13] [RE:285] [DI:direct] >gp : [GI :gl43065] 
[LN : BACHUBl] [AC:M73500] [PN:hubst] [GN:hubst] [OR: Bacillus stearothermophilus] 
[SR:Bacillus stearothermophilus DNA] [DB :genpept-bctl] [DE : B . stearothermophilus hubst 

gene, complete cds . ] [LE:1] [RE:>270] [DI:direct] >gp : [GI :gl43067] [LN: BACHUB2] 
[AC:M73501] [PN:hubcalx] [GN:hubcalx] [OR:Bacillus caldotenax] [SR:Bacillus caldotenax 

DNA] [DB:genpept-bctl] [DE : B . caldotenax hubcalx gene, 5' end.] [LE:1] [RE:>270] 
[DI:direct] >gp : [GI : gl43069] [LN :BACHUB3] [AC :M73502] [PN:hubcald] [GN:hubcald] 
[OR:Bacillus caldolyticus] [SR:Bacillus caldolyticus DNA] [DB :genpept-bctl] 
[DE:B. caldolyticus hubcald gene, 5' end.] [LE:1] [RE:>270] [DI:direct] 

NT AA 

ORF Name NT ID AAID — — , , score probability 
LengthLength c L ~ 



AI7S030009B2_20S87B36J::1._4 | [274 | [ 4045 |p7T~| [209 || 241 |pTT e-20 | 



Description 

sp: [LN:XPAC_BACSU] [AC:P37467] [GN:XPAC] [OR:BACILLUS SUBTILIS] [DE:XPAC PROTEIN] 
[SP:P37467] [DB : swissprot] >pir : [LN : S2 7526] [AC : S2 7526 : S66055 : C69734 ] 

[PN: 5-bromo-4-chloroindolyl phosphate hydrolysis protein xpaC:xpaC protein] [GN:xpaC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl005803 : g467415] [LN:BAC180K] [AC:D26185] 
[PN: hydrolysis of 5-bromo-4-chloroindolyl phosphate] [GN:xpaC] [OR: Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB :genpept-bctl] [DE : B . 

subtilis DNA, 180 kilobase region of replication origin.] [LE: 99445] [RE: 100059] 
[DI: direct] >gp : [GI :gl43830] [LN:BACXPAC] [AC:M96156] [GN:xpaC] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain W168) DNA] [DB : genpept - bet 1 ] [DE:Bacillus subtilis ORF1 

and xpaC gene, complete cds's; ssrRNA gene, 3' end; ORF3C 5' end.] [LE:532] [RE:1146] 
[DI:direct] >gp: [GI : el 181958 :g2 6322 92] [LN: BSUB0001] [AC : Z99104 : AL009126] [GN:xpaC] 
[FN: hydrolysis of 5-bromo 4-chloroindolyl phosphate] [OR: Bacillus subtilis] 
[DB : genpept- bet 1] [DE:Bacillus subtilis complete genome (section 1 of 21) : from 1 

to213080.] [SP:P37467] [LE:35843] [RE:36457] [DI: direct] 

NT AA 

ORF Name NT ID AAID — ^,_ T — , , score probabil ity 
LengthLength ^ 



|AI7S03000982_20S94688_c2_i33i | [275 | [ 4047 || 172 B"| [ 574 [ p.26 ||6.2e-08 — | 

Description 

Sp : [LN : LPLA_BACSU] [AC:P37966] [GN: LPLA] [OR:BACILLUS SUBTILIS] [DE : LIPOPROTEIN LPLA 
PRECURSOR] [SP:P37966] [DB : swissprot] >pir : [LN: 139876] [AC : 139876 :H69652 ] 
[PN: lipoprotein lplA:lysis protein lplA] [GN:lplA ] [CL:Bacillus subtilis lipoprotein 
lplA] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82689 :g2633023] [LN : BSUB0004] 
[AC:Z99107:AL009126] [PN : lipoprotein] [GNrlplA] [OR:Bacillus subtilis] 

[DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 4 of 21) : from 600701 
to813890.] [SP:P37966] [LE:178337] [RE:179845] [DI:direct] >gp : [GI : g4312 72 ] 
[LN : BACLPLA] [AC:L03376] [PN:lysis protein] [GN:lplA] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (st rain Marburg 168) DNA] [DB : genpept -be t2] [DE: Bacillus subtilis 
lysis protein (lplA) gene, complete cds.] [LE:482] [RE:1990] [DI:direct] 
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ORF Name 



NTID 



score 



11 7 503000932 217755577 cl 1135 



][ 



NT AA 
LengthLenqth 
[4045 ||774 ||Z57 |pTT 



probability 
p.be-26 



Description 

gp: [GI:g4981613] [LN: AE001767] [AC : AE001767 : AE000512] [PN : transcriptional regulator, 
DeoR family] [GN : TM1069] [OR : Thermo toga maritima] [DB : genpept-bct2] [DE : Thermo toga 
maritima section 79 of 136 of the complete genome.] [NT: similar to GB:AL009126 percent 
identity: 55.02;] [LE:1578] [RE: 2336] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID — , — 



LengthLength 



score 



|AI7503000982_2079i068__c3_i370 | |277 | [4049 ||273 1 190 | [7T 



probability 
T5TTJ5I 



Description 

gp: [GI:g294060] [LN : PAPMPL146A] [AC:L06467] [PN:major latex protein] [GN:MLP146] 
[OR:Papaver somniferum] [SR:Papaver somniferum (strain UNL186) (library: EMBL) DNA] 
[DB:genpept-plnl] [DE : Papaver somniferum major latex protein (MLP146) gene, complete 
cds.] [LE: 963: 1290] [RE : 1167 : 1564] [DI : direct Join] 



ORF Name 



NT AA 
NTID AAID — „ — _ score 
LengthLength 



probability 



[AI7503000982 20876263 tl 159 



Description 
NO-HIT 



2 |278 | |4050 |pT7T | 



ORF Name 



NTID 



NT 



AA 



AAID , — , „ — , score probability 
LengthLength L - 



IAI7503000982 2148387 cl 1089 



"j |279 | [4051 |p73~||425 | [1794 ||5 . 8e-185 



Description 

Sp : [LN : AROC_STAAU] [AC:Q59803] [GN: AROC] [OR : STAPHYLOCOCCUS AUREUS] [EC:4.6.1.4] 
[DE : PHOSPHOLYASE) ] [SP:Q59803] [DB : swissprot] >gp : [GI:g987498] [LN : SAU3 1979] 
[AC:U31979] [PN : chorismate synthase] [GN:aroC] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [EC: 4. 6. 1.4] [DE : Staphylococcus aureus chorismate synthase (aroC) and 
nucleosidediphosphate kinase (ndk) genes, complete cds, dehydroauinatesynthase (aroB) 
and geranylgeranyl pyrophosphate synthetase homolog (gerCC) genes, partial cds.] 
[NT: 5-enolpyruvylshikimate 3-phosphate phospho- lyase] [LE:1142] [RE: 2308] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



^ — ^, „ — L1 score prob ability 
LengthLength c *- 



p7503000982^21501550_cl_1127 | |280 | [ 4052 | pir^] |37i ||Sgg-|| l . 0e-100 

Description 

gp: [GI:g3688811] [LN : AF084104] [AC : AF084104] [PN:maltose transportor ATP-binding 
protein] [GN:malK] [OR:Bacillus firmus] [DB : genpept -bet 2 ] [DE:Bacillus firmus AcsA 

(acsA) gene, partial cds; SspA (sspA) , hypothetical protein, maltose transportor 
ATP-binding protein (malK) , leucine-rich protein transcriptional regulator 

(IrpR) , hypothetical proteins, ABC transporter ATP-binding protein (natC) ,NatA (natA) , 
NatB (natB) , and hypothetical protein genes, completecds; and SpoIIIJ (spoIIIJ) gene, 
partial cds.] [NT :MalK; Orf4; similar to MsmX from Bacillus subtilis,] [LE:2390] 

[RE : 3490] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID — — , - score probability 
LengthLength c 



AI7503000982_2Ib2<!>S62_c2_1206 

Description 

NO-HIT 



j |28i | [4053 |[TT3~~||45 | 
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NT AA 

ORF_Name NTID AAID Le ~ thLe ~ th g^ probability 



AI7503000982_21b31627_c3_1483 | | 282 | [ 4054 ||132 ~| |4J | 



Description 
NO-HIT 



ORF Name 



NTID 



E 



17503000982 21537962 ci 1093 



„ Tn NT AA 
AAID t — — ^ 
LengthLenqth 



score 



TUT 



JET 



probability 
i.ie-32 



Description 

sp: [LN:YPJA_BACSU] [AC:P54392] [GNrYPJA] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 21.3 
KD PROTEIN IN QCRC-DAPB INTERGENIC REGION] [SP:P54392] [DB : swissprot] >pir : [LN : A69937] 

[AC:A69937 ] [PN : hypothetical protein ypjA] [GNiypjA ] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:gll46230] [LN : BACYPIA] [AC:L47709] [GN:ypjA] [FN : hypothetical] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DErBacillus subtilis (clone YAC15-6B) ypiABF genes, qcrABC 
genes , ypj ABCDEFGHI genes, birA gene, panBCD genes, dinG gene, ypmB gene,aspB gene, asnS 
gene, dnaD gene, nth gene and ypoC gene, completecds 1 s . ] [NT : puta t i ve ] [LE:5002] 

[RE:5559] [Dlidirect] >gp : [GI : ell83698 :g2634671] [LN:BSUB0012] [AC : Z99115 : AL009126] 

[GNrypjA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 12 of 21): from 2195541to 2409220.] [SP:P54392] [LE:166108] 

[RE : 166665] [DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score probability 



|Al7503000982_21568762_clJ.0i3 | [284 | | 40b6 | |1005 1 1334 | [992 || 5.6e-100 

Description 

gp: [GI:el356351:g3947511] [LN:BMAJ4829] [AC : AJ22482 9] [GN : ORF4] [OR:Bacillus 
megaterium] [DB : genpept-bctl] [DErBacillus megaterium DSM319 spoIV operon, 5' 
region, 3' flanking region.] [LE:3056] [RE:4030] [DI:direct] 



flanking 



ORF Name 



_ _ _ _ _~ NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



lAl7b03000982 2160032b tl 228 



2 |285 [ [4057 |4g | 



Description 
NO-HIT 

ORF Name 



NTID 



— _ _— NT AA 

AAID — — score 

LengthLength 



|Al7b03000982 21640636 tl 136 



14058 11276 I m 



probability 
|1.2e-33 



Description 

pir : [LN:D69621] [AC:D69621 ] [PN : f erredoxin fer] [GN:fer ] [CL : f erredoxin 
2 [4Fe-4S] : ferredoxin 2 [4Fe-4S] homology] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:ell83749:g2634722] [LN : BSUB0012] [AC : Z99115 : AL009126] [PN: ferredoxin] [GN:fer] 
[ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete genome (section 
12 of 21): from 2195541to 2409220.] [NT : alternate gene name: ypbA] [SP:P50727] 
[LE:213423] [RE:213671] [DI:direct] >gp : [GI :ell85573:g2 634739] [LN: BSUB0013] 
[AC:Z99116 :AL009126] [PN : ferredoxin] [GN:fer] [ORrBacillus subtilis] [DB r genpept-bctl] 
[DErBacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT:alternate gene name: ypbA] [SP:P50727] [LE:13703] [RE:13951] [DI:direct] 
>gp: [GI:gll46198] [LN:BACSERA] [AC:L47648] [PN : ferredoxin] [GN:fer] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis phosphoglycerate dehydrogenase 
(serA) , ypaA, ferredoxin (fer), ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate 
dehydrogenase (ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, 
ypfB, cytidine monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 
dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete cds.] 
[NT : ypbA ; similar to B . stearothermophilus ferredoxin;] [LE:2726] [RE:2974] 
[DI : complement] 
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ORF Name 



NTID 



NT 



AA 



AAID _ — . , , — 



LengthLength 



score 



|A17j>03000962 21667676 ci 975 



probability 
"| |287 | [4059 ||888 | |295 | |b90 | |2.2e-b7 



Description 

pir : [LN:C69981] [AC:C69981 ] [PN : conserved hypothetical protein yrvM] [GNryrvM ] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell84002 : g2635218] [LN : BSUB0015] 
[AC:Z99118 :AL009126] [GN : yrvM] [FNrunknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DEiBacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to hypothetical proteins] [LE: 17779] [RE: 18264] [DI : complement] 



ORF Name 



|AI750300u982 21S740S7 EI 8 



Description 
NO-HIT 



NT AA 
NTID AAID — . — t , score 
LengthLength 



probability 



] |288 | [ 4060 || 6S | 



ORF Name 



E 



17503000982 21678187 c3 1486 



Description 
NO-HIT 



NTID 



_ _ _ _ NT AA 

AAID — — , „ score 

LengthLength 

[4061 [[240 | [79 | 



probability 



NT AA 

ORF Name NTID AAID — , — , score probability 

LengthLength 



[AI750300u982_21730443_ci_i05£ [ [290 j [ 4062 ||TT?T"| | 443 "[[ 904 [| i.2e-90 [ 

Description 

sp: [LN:ODB2_BACSU] [AC:P37942] [GN: BFMBB : BFMB2 : BFMB] [OR:BACILLUS SUBTILIS] 
[EC:2.3.1.-] [DE : CHAIN TRANSACYLASE) ] [SP:P37942] [DB : swissprot] >pir: [LN:S32488] 
[AC:S32488 :E69593 ] [PN : di hydro lipoamide S-acyltransf erase, , alpha-oxo acid 
dehydrogenase complex (bfmBB) : branched- chain alpha-oxo acid dehydrogenase complex E2 
component : dihydrolipoyl acyltransf erase] [GN: bfmBB ] [CL :dihydrolipoamide 
acetyltransf erase: lipoyl/biot in-binding homology] [OR: Bacillus subtilis] [EC: 2. 3.1.-] 
[DB:pir2] >gp: [GI:gl42613] [LN: BACBRANCH] [AC : M97391 : M96 937] [PN:branched chain 
alpha-keto acid dehydrogenase E2] [OR: Bacillus subtilis] [SR: Bacillus subtilis DNA]' 
[DB: genpept-bctl] [DE:Bacillus subtilis branched chain alpha-keto acid 
dehydrogenases 1 -alpha, branched chain alpha-keto acid dehydrogenase El-beta, 
andbranched chain alpha-keto acid dehydrogenase E2 , complete cds . ] [LE:2228] [RE:3502] 
[DI: direct] >gp: [GI : dl0132 79 :gl303944] [LN : BACJH642] [AC :D84432 :D82370] [PN: BfmBB] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:221574] [RE:222848] [DI: direct] >gp : [GI :ell85671:g2 634837] [LN : BSUB0013 ] 
[AC:Z99116 :AL009126] [PN : branched -chain alpha-keto acid dehydrogenase E2] [GN: bfmBB] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.3.1.-] [DE:Bacillus subtilis complete 
genome (section 13 of 21): from 2395261to 2613730.] [NT : alternate gene name: bfmB2] 
[SP:P37942] [LE:100771] [RE:102045] [DI : complement] 
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NT AA 

ORFJTame NTID AAID Le - thLe ~ th score probability 



AI7S030009fl2_217;>6S fc2_c2_12fl9 | |291 | [ 4063 | | 596 1 | 231 || 162 | | B.ie- T7 



Description 

pir: [LN:G69828] [AC:G69828 ] [PN : calcium-binding protein homolog yheG] [GNryheG ] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell82974 :g2633308] [LN : BSUB0006] 
[AC:Z99109:AL009126] [GNryheG] [FNrunknown] [ORrBacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21) : from 999501 tol209940.] 
[NT: similar to calcium-binding protein] [LE: 49122] [RE: 49742] [DI : complement] 

>gp: [GI:e325187:g2226164] [LN : BSY14080] [AC:Y14080] [PN : hypothetical protein] [GN:yheG] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis chromosomal DNA, region 

75 degrees: sspB upstreamof glyB.] [NT : similarity to bovine flavin reductase (PID =] 
[LE: 11988] [RE:12608] [DI:direct] 

NT AA 

ORF Name NTID AAID — — _ score probability 

LengthLength c L - 



Al7b030009a2_2i759427_c3_i407 | |292 | [ 4054 |p5u~| | 449 |p^"| p.7e-143 



Description 

sp: [LN:GCS1_BACSU] [AC:P54376] [GN : YQHJ] [OR:BACILLUS SUBTILIS] [EC:1.4.4.2] 
[DE : PROTEIN) ] [SP:P54376] [DB : swissprot] >pir : [LN : A69959] [AC:A69959 ] [PN:glycine 
dehydrogenase homolog yqhJ] [GN:yqhJ ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013226:gl303891] [LN : BACJH642] [AC : D84432 : D82370] [PN:YqhJ] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 171390] 
[RE:172736] [DI:direct] >gp : [GI :ell85724:g2 634890] [LN: BSUB0013] [AC : Z99116 : AL009126] 
[GN:yqhJ] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 13 of 21) : from 2395261to 2613730.] [NT: similar to glycine 
dehydrogenase] [SP:P54376] [LE: 150885] [RE: 152231] [DI : complement] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



^17503000982 21914057 c3 1512 1 1 14065 II14I6 1 1571 



] |293 | [4055 |p^T5-] |471 | p2~ | |3 . ie-51 



Description 

gp: [GI:g2182835] [LN : LLU81166] [AC:U81166] [PN : hist idine kinase LlkinA] [GN: llkinA] 
[OR:Lactococcus lactis subsp . cremoris] [DB :genpept-bct2] [DE : Lactococcus lactis subsp. 

cremoris MG1363 histidine kinase (llkinA) gene , complete cds . ] [LE:1] [RE:1473] 
[DI: direct] 



NT AA 

ORF Name NTID AAID — _ Ut — , , score probability 

Length Length c • L - 



[AI7503 000982^21953 07_c3_1402 | |294 | |4065 |pU^T] pg4 ~|| 348 | | 9.9e-32 

Description 

sp: [LN:CMG2_BACSU] [AC:P25954] [GN : COMGB : COMG2] [OR:BACILLUS SUBTILIS] [DE : COMG OPERON 
PROTEIN 2] [SP:P25954] [DB : swissprot] >pir : [LN: C30338] [AC : C30338 : C69603 ] [PN:DNA 
transport machinery protein comGB:comG operon protein 2] [GN: comGB ] [OR: Bacillus 
subtilis] [DB:pir2] >gp : [GI :gl42707] [LN: BACCOMGA] [AC : M2 9691 :M22854] [GN:comG2] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain 168) (clone: pED4) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis (clone pED4) comG- (1 , 2 , 3 , 4 , 5 , 6 , and 7) proteins 
incomG operon, complete cds.] [LE:2420] [RE: 3391] [DI:direct] 

>gp: [GI:dl013213:gl303878] [LN : BACJH642] [AC : D84432 : D82370] [PN:ComGB] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain :JH642 (trpC2 PheAl)) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 160650] 
[RE:161621] [DI:direct] >gp : [GI : ell85 74 0:g2 634906] [LN : BSUB0013] [AC : Z99116 : AL009126] 
[PN:probably part of the DNA transport machinery] [GN : comGB] [FN: competence] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [SP:P25954] [LE:162000] [RE:162971] [DI : complement] 
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ORF Name 



NT ID 



NT 



AA 



AAID _ L _ 



LenqthLength 



score 



probability 



AI7503000982_21962762_clJ.i39 | [295 | [4057 ||732-j|243 | p5~-||b . Oe-60 



Description 

gp: [GI :gl43267] [ LN : B ACODHAB ] [AC:M27141] [ORrBacillus subtilis] [SR : B . subtilis (strain 
3G18) DNA] [DB:genpept-bctl] [DE :B . subtilis 2 -oxoglutarate dehydrogenase (odhA) gene 3» 
end, anddihydrolipoamide transsuccinylase (odhB) gene, complete cds . ] 
[NT: 2 -oxoglutarate dehydrogenase (odhA; EC 1.2.4.2)] [LE:<1] [RE:883] [DIrdirect] 



ORF Name 



NTID 



AAID , . 



^7503000982 22031307 t3 906 



NT AA 
LengthLength 

[4068 ||iS0 1 1 49 [ 



score probability 



Description 
NO-HIT 

ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7&03000982 2227312 cl 974 



it 



|4069 |p7?T]|593 | [2019 ||8 . 4e-209 



Description 

sp: [LN:SYD_BACSU] [AC:O32038] [GN:ASPS] [OR:BACILLUS SUBTILIS] [EC : 6 . 1 . 1 . 12] 
[DE:(ASPRS)] [SP:O32038] [DB : swissprot] >pir: [LN:D69591] [AC:D69591 ] 

[PN: aspartate- -tRNA ligase, aspS : aspartyl- tRNA synthetase] [GN:aspS ] [CL : lysine- -tRNA 
ligase] [OR:Bacillus subtilis] [EC : 6 . 1 . 1 . 12] [DB:pir2] >gp : [GI : ell84003 : g2635219] 
[LN:BSUB0015] [AC : Z99118 : AL009126] [PN : aspartyl- tRNA synthetase] [GN:aspS] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [EC : 6 . 1 . 1 . 12] [DE: Bacillus subtilis complete genome 
(section 15 of 21) : from 2795131to 3013540.] [SP:O32038] [LE:18878] [RE:20656] 
[DI : complement] 



ORF Name 



NTID 



^ m NT AA 

AAID — , — score 

LengthLength 



probability 



IAI7503000982 22459462 c2 1275 



|298 | [4070 ||138 1 145 | 



Description 
NO-HIT 



ORF Name 



NTID 



„ TO NT AA 

AAID — _ — , score 

Length Length 



probability 



IAI7503000982 22661088 c3 1363 



j |299 | [4071 |[T5u— 1|49 | 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — _ , — , , score 
L en gth Length 



probability 



A17503000982_23392_±2_366 

Description 

NO-HIT 
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ORF Name 



NT AA 
NT ID AAID — -« — A , score 
Length Length 



probability 



IAI75030009U2 2J44b2^b cl 1120 



"] ppl | [ 4073 | |557 |p?~| p.3e-S5 



Description 

sp: [LN:PMSR_BACSU] [AC:P54154] [GN:YPPP] [OR:BACILLUS SUBTILIS] [DE : REDUCTASE) ] 
[SP:P54154] [DB : swissprot] >pir : [LN: E69940] [AC:E69940 ] [PN: peptide methionine 
sulfoxide reductase homolog yppP] [GN:yppP ] [CL: peptide methionine sulfoxide 
reductase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl256653] [LN : BACYACA] [AC:L77246] 
[PN:DNA-binding protein] [GN:yppP] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DErBacillus subtilis (YAC10-9 clone) DNA region between the serA andkdg loci.] 
[NT: 42. 4% identity with the Lycopersicon esculentum] [LE: 33389] [RE: 33922] [DI: direct] 
>gp: [GI:ell83616:g2634589] [LN : BSUB0012] [AC : Z99115 : AL009126 ] [GN:yppP] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
12 of 21): from 2195541to 2409220.] [NT:similar to peptide methionine sulfoxide 
reductase] [SP:P54154] [LE:91230] [RE:91763] [D I : complement] 



ORF Name 



NTID 



a7vtt ^ NT AA 

AAID — t — , . score 

LengthLength 



probability 



IAI7S03000932 23445762 c3 1557 



Description 
NO-HIT 



^ |302 | |4074 ||144 | |47 [ 



ORF Name 



NTID 



NT 

AAID . 



AA 



LengthLength 



score probability 



7B03000982 23468938 ti 238 



TOT" 



Description 
NO-HIT 



EEZD 



[TUTS I 



ITS" 



ORF Name 



E 



17503000982 23470327 c2 1225 



NTID 



ruT" 



AAID 



NT 



AA 



LengthLength 



score 



probability 
■j [4076 |p^T] |48g |p5~ | |4.1e-42 



Description 

pir : [LN:S52351] [AC:S52351 ] [PN : hypothetical protein 1] [OR : Staphylococcus xylosus] 
[DB:pir2] >gp: [GI :g666115] [LN: SXGKG2] [AC:X84332] [GN:ugl] [OR : Staphylococcus xylosus] 
[DB: genpept-bctl] [DE : S . xylosus glucose kinase gene.] [NT:orfl upstream of glucose 

kinase] [LE:<1] [RE: 406] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength ' 



probability 



503000982 2 ^ 4704S2 c2 lm 



] |305 [ [4077 |[¥Tr~||i45 ||214 ||1.6e-17 



Description 

pir : [LN:F71860] [AC:F71860 ] [PNrbiotin carboxyl carrier protein] [GN: accB ] [CL:biotin 
carboxyl carrier protein: lipoyl/biotin-binding homology] [OR : Helicobacter pylori] 
[SR:strain J99, , strain J99] [SR:strain J99, ] [DB:pir2] >gp : [GI :g4155592] 
[LN:AE001529] [AC : AE001529 : AE00143 9] [PN:BIOTIN CARBOXYL CARRIER PROTEIN] [GN : accB] 
[OR Helicobacter pylori J99] [DB : genpept-bct2] [DE Helicobacter pylori, strain J99 
section 90 of 132 of the completegenome . ] [NT: similar to H. pylori 26695 gene HP0371] 
[LE:5447] [RE: 5932] [DI:direct] 



NT AA 

ORF Name NTID AAID — _ T — , . score probability 

Length Length 

|AI7B03000982_2347667fc_c2J.204 [ |306 | [4078 |122 | | 227 ||6.6e-19 

Description 

gp: [GI:g902055] [LN:BSU29177] [AC:U29177] [PN : diacylglycerol kinase] [GN:dgk] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis PhoH (phoH) gene, 

partial cds, diacylglycerolkinase (dgk) gene, complete cds, and cytidine deaminase 
(cdd) gene, partial cds.] [LE:2793] [RE:3197] [DI:direct] 
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ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000982 2359S13V c2 1185 



] P 07 1 1 4079 IP" 8 "!! 185 IP 2 I f 



probability 
3.6e-50 



Description 

sp: [LN:YQEG_BACSU] [AC:P54452] [GN: YQEG] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 20.1 
KD PROTEIN IN NUCB-AROD INTERGENIC REGION] [SP:P54452] [DB : swissprot] >pir : [LN: C69951] 
[AC:C69951 ] [PN : conserved hypothetical protein yqeG] [GNryqeG ] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl013122 : gl303787] [LN: BACJH642] [AC : D84432 : D823 70] [PNrYqeG] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] 
[DB: genpept-bctl] [DErBacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:72502] [RE:73020] [DI: direct] >gp : [GI : ell83 798 : g263 5014 ] [LN : BSUB0014 ] 
[AC:Z99117:AL009126] [GN:yqeG] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DErBacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT: similar to hypothetical proteins] [SP:P54452] [LE: 46410] [RE: 46928] [DI : complement] 



ORF Name 



NTID 



E 



17503000982 23597252 £2 557 



Description 
NO-HIT 



ORF Name 



AI7503000982 23620205 12 640 



Description 
NO-HIT 



n7VTT ^ NT AA 

AAID — — , score 

LengthLength 



probability 



|4080 |5S 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



probability 



] I 309 1 | 4081 || 158 || 4S ~| 



ORF Name 



IAI7503000982 23S25000 c2 1245 



NTID 



pur 



NT 

AAID 



AA 



LengthLength 



score 



[4082 |pTFT| 



probability 
7T7T]f2.0e-24S 



Description 

pir: [LN:S44188] [AC:S44188 ] [PN : alpha-glucosidase , ] 

[CL : alpha -glucos idase : alpha -amylase core homology] [OR : Staphylococcus xylosus] 
[EC:3.2.1.20] [DB:pir2] >gp : [GI : g474177] [LN : SXMALRAG] [AC:X78853] 

[PN:alpha-D-l, 4-glucosidase] [GNrmalA] [OR : Staphylococcus xylosus] [DB : genpept-bctl] 
[EC:3 .2.1.20] [DE : S . xylosus malR gene and malA gene.] [LE:1454] [RE:3103] [DI:direct] 



ORF Name 



[AI7503000982 ^3525383 t3 761 



Description 
NO-HIT 



XTfnTr , „ NT AA 

NTID AAID — _ — _ score 

LengthLength 



probability 



~j |311 | [4083 ||132 | [43 | 



ORF Name 



117503000982 2352642b cl 1042 



NTID 



„ T „ NT AA 

AAID — — , _ score 

LengthLength 



][ 



TIT" 



|4084 [P^^] p^T" 



TUT 



probability 
1.0e-75 



Description 

pir: [LN:S72490] [AC : S72490 : 139765 ] [PN:N- acetyl -gamma -glutamyl -phosphate reductase,] 
[GN:argC ] [CL :N- acetyl -gamma -glutamyl -phosphate reductase] [OR: Bacillus 
stearothermophilus] [EC : 1 . 2 . 1 . 38] [DB :pir2] 
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NT AA 

ORFName NTID AAI2 Le ~ thLe ~ th gcore probability 



Al7503000982J>JM^942_ci_1012 | | 313 | [4085 ||696 | |231 |pn~ |[T~ge 



Description 

pir : [LN:B71609] [AC:B71609 ] [PN: hypothetical protein PFB0680w] [GN:PFB0680w ] 
[OR : Plasmodium falciparum] [DB:pir2] >gp : [GI :g3845248] [LN: AE001410] 
[AC:AE001410:AE001362] [PN : hypothetical protein] [GN : PFB0680w] [OR : Plasmodium 
falciparum] [SRrmalaria parasite P. falciparum] [DB : genpep t - inv2 ] [DE : Plasmodium 
falciparum chromosome 2, section 47 of 73 of thecomplete sequence.] [NT : predicted by 
GlimmerM] [LE : 10507 : 10754 : 12646] [RE : 10567 : 12528 : 12807] [DI : direct Join] 



NT AA 

ORF Name NTID AAID — _ T — j , score probability 
LengthLength c 



[AI7503000982 23703452 ci 1037 1 1 [4086 |[TFuT]pT — IfTEOT 



i.9e-186 



Description 

sp: [LN:GCS2_BACSU] [AC:P54377] [GN: YQHK] [OR : BACILLUS SUBTILIS] [EC:1.4.4.2] 
[DE : PROTEIN) ] [SP:P54377] [DB : swissprot] >pir : [LN: B69959] [AC:B69959 ] [PN:glycine 
dehydrogenase homolog yqhK] [GN:yqhK ] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013227:gl303892] [LN:BACJH642] [AC : D84432 : D82370] [PN : YqhK] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 172729] 
[RE:174195] [DI:direct] >gp : [GI :ell85723 :g2 634889] [LN: BSUB0013] [AC : Z99116 : AL009126] 
[GN:yqhK] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 13 of 21): from 2395261to 2613730.] [NT:similar to glycine 
dehydrogenase] [SP:P54377] [LE: 149426] [RE: 150892] [DI : complement] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000982_23707890_c2_1322 | |315 | pFT^] pu~| |7 9 | pi~ || *> . 9e-08 "| 



Description 

pir: [LN:A69931] [AC:A69931 ] [PN : hypothetical protein yozE] [GN: yozE ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell8543 9 :g2 634360] [LN : BSUB0011] [AC : Z99114 : AL009126] 
[GN :yozE] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900.] [LE:137942] [RE:138166] 
[DI complement] 



NT AA 

ORF Name NTID AAID — * — _ score probability 

LengthLength £ - 

|AI7503000982_2 38i88b__ci_988 | |315 | [4088 | | 127 J] |423 | [1066 || 8 . le-108 



Description 

pir : [LN:E69765] [AC : E69765 : 139896 : 139895 ] [PN:branched chain amino acids transporter 
homolog ycsG] [GN :ycsG ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82373 :g2632707] 
[LN:BSUB0003] [AC : Z99106 : AL009126] [GN : ycsG] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 3 of 21) : from 402751 
to611850.] [NT: alternate gene name: ycsH; similar to branched] [LE: 54666] [RE: 55826] 
[DI : direct] 



151 



NT AA 

ORFJTame NTID AAID Le ~ thLe ~ th gcore probability 



A17503000982_2JaJb0S2J:2_497 | pi 7 | [4089 | | B4S 1 1 282 || J92 p.le-T T 



Description 

sp: [LN:PROI_BACSU] [AC:P54552] [GN : YQJO] [OR : BACILLUS SUBTILIS] 
[DE : PYRROL INE - 5 - CARBOXYLATE REDUCTASE HOMOLOG 2] [SP:P54552] [DB : swissprot] 
>pir: [LN:G69964] [AC:G69964 ] [PN : pyrrol ine- 5 -carboxylate reductase homolog yqjO] 
[GNryqjO ] [CL : pyrrol ine- 5 -carboxylate reductase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013301:gl303966] [LN: BACJH642] [AC : D84432 : D82370] [PN:YqjO] [OR:Bacillus 
subtilis] [SRrBacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 245663] 
[RE:246499] [DI : complement] >gp: [GI : ell85649 : g2634815] [LN : BSUB0013] 

[AC:Z99116 :AL009126] [GN:yqjO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT: similar to pyrroline- 5 -carboxylate reductase] [SP:P54552] [LE: 77120] [RE: 77956] 
[DI: direct] 



NT AA 

ORF Name NTID AAID — _ „ — _ score probability 
LengthLength u 



AI7603000382_238S0302_c3_i3g9 | |3i8 | [ 4090 |p^¥] |897 | pp95 || i . 9e-2 ^T 



Description 

sp: [LN:SYA_BACSU] [AC:034526] [GN: ALAS] [OR:BACILLUS SUBTILIS] [EC:6.1.1.7] 
[DE : ALANYL -TRNA SYNTHETASE, (ALANINE- -TRNA LIGASE) (ALARS) ] [SP:034526] [DB : swissprot] 
>pir: [LN:A69584] [AC:A69584 ] [PN : alanine- - tRNA ligase, alaS : alanyl- tRNA synthetase] 
[GN:alaS ] [CL : alanine- -tRNA ligase] [OR:Bacillus subtilis] [EC:6.1.1.7] [DB:pir2] 
>gp: [GI:ell83970:g2635186] [LN : BSUB0014] [AC : Z99117 : AL009126] [PN : alanyl - tRNA 
synthetase] [GN:alaS] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:6.1.1.7] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[SP:034526] [LE:197994] [RE:200630] [DI : complement] >gp: [GI : ell83988 :g2635204] 
[LN:BSUB0015] [AC : Z99118 : AL009126] [PN : alanyl -tRNA synthetase] [GN:alaS] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [EC:6.1.1.7] [DE:Bacillus subtilis complete genome (section 
15 of 21): from 2795131to 3013540.] [SP:034526] [LE:2314] [RE:4950] [DI : complement] 

NT AA 

ORF Name NTID AAID — _„ — , , score proba bility 
LengthLength c 



AlVS03000982_23884692_c3_i416 | |319 | [4091 | 121 | p77~|| 7 . 5e-18 



Description 

sp : [LN : YQHY_BACSU] [AC:P54519] [GN : YQHY] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 14.7 
KD PROTEIN IN ACCC-FOLD INTERGENIC REGION] [SP:P54519] [DB : swissprot] >pir : [LN : E6996 0] 
[AC:E69960 ] [PN: conserved hypothetical protein yqhY] [GN:yqhY ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl 013249 :gl 303914] [LN : BACJH642] [AC :D84432 :D82370] [PN : YqhY] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl)) DNA] 
[DB: genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:189273] [RE:189680] [DI:direct] >gp : [GI : ell8 5701 : g2634867] [LN : BSUB0013 ] 
[AC:Z99116:AL009126] [GN:yqhY] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT:similar to hypothetical proteins from B. subtilis] [SP:P54519] [LE:133941] 
[RE : 134348] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ — , score probability 
- LengthLength 



AI7g03000982_2:i9i2502_r2_605 | | 320 | [4092 | [T?B~| | 45 | 



Description 
NO-HIT 



152 



ORF Name 



NT AA 
NTID AAID — .« _ — . , score 
LengthLength 



probability 



117503000982 2J964011 12 352 



"j | 321 | [4093 1)1913 — | jfe4 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



^17503000982 24042212 C3 1403 



TIT 



[4094 |[^~| [155 | [72 ||0.034~ 



Description 

sp: [LN:CMG3_BACSU] [AC:P25955] [GN : COMGC : COMG3 ] [OR:BACILLUS SUBTILIS] [DE : COMG OPERON 
PROTEIN 3 PRECURSOR] [SP:P25955] [DB : swissprot] >pir: [LN:D30338] 

[AC :D30338 : A3 513 3 : D69603 ] [PN : exogenous DNA-binding protein comGC:comG operon protein 
3] [GNiComGC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl42708] [LN : BACCOMGA] 
[AC:M29691 :M22854] [GN:comG3] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain 168) 
(clone: pED4) DNA] [DB :genpept-bctl] [DE: Bacillus subtilis (clone pED4) 
comG- (1 , 2 , 3 , 4 , 5 , 6, and 7) proteins incomG operon, complete cds . ] [LE:3405] [RE:3701] 
[DIrdirect] >gp : [GI : dl 013 2 14 : gl303879] [LN : BACJH642] [AC : D84432 : D823 70] [PN:ComGC] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] 
[DB :genpept-bctl] [DErBacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:161635] [RE:161931] [DI:direct] >gp : [GI : ell85739 :g2634905] [LN : BSUB0013] 
[AC:Z99116 :AL009126] [GN:comGC] [FN : exogenous DNA-binding (competence)] [OR:Bacillus 
subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete genome (section 13 of 21): 
from 2395261to 2613730.] [SP:P25955] [LE:161690] [RE:161986] [DI : complement] 



ORF Name 



NTID 



NT 

AAID „ 



AA 



LengthLength 



LL7b03000982 24071069 tl 186 



TIT 



[4095 |[?^~[|i32 [pre 



score probability 
1.7e-27 



Description 

sp: [LN:YQHL_BACSU] [AC:P54510] [GN : YQHL] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 14.6 
KD PROTEIN IN GCVT-SPOIIIAA INTERGENIC REGION] [SP:P54510] [DB : swissprot ] 
>pir: [LN:C69959] [AC:C69959 ] [PN:glpE protein homolog yqhL] [GN:yqhL ] [CL:glpE 
protein] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013228 :gl303893] [LN : BACJH642] 
[AC:D84432 :D82370] [PN : YqhL] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl)) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 174230] [RE: 174610] [DI : complement] 
>gp: [GI:ell85722 :g2634888] [LN: BSUB0013] [AC : Z99116 : AL009126] [GNryqhL] [FN:unknown] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NT:similar to hypothetical proteins] [SP:P54510] 
[LE : 149011] [RE : 1493 91] [DI : direct] 



ORF Name 


NTID 


AAID 


NT AA 
, — . , . — , , score probability 
LengthLength c -L 


|AI7503000982_24094090_ci__986 


324 


4096 


|957 318 ||927 ||4.4e-93 | 



Description 

pir : [LN:G69979] [AC:G69979 ] [PN : proteinase homolog yrrN] [GN:yrrN ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell83965:g2 635181] [LN : BSUB0014] [AC : Z99117 : AL009126] 
[GN:yrrN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 14 of 21): from 2599451to 2812870.] [NT:similar to protease] 
[LE : 193967] [RE : 194896] [DI : complement] 
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NT AA 

ORFName NTID AAID ^ th ^— th score probability 



AI7503000982_2410071b_c3_1422 | [ 325 | [ 4097 | pyT| [ 344 [ pW]| b . 9e-89 



Description 

sp: [LN:ODBAJBACSU] [AC:P37940] [GN : BFMBAA : BFMB1A] [OR:BACILLUS SUBTILIS] [EC:1.2.4.4] 
[DE: (BCKDH El -ALPHA) ] [SP:P37940] [DB : swissprot] >pir : [LN : C6 9593 ] [AC : C69593 : S32486 ] 
[PN: 3-methyl-2-oxobutanoate dehydrogenase (lipoamide) , El alpha chain 
bfmBAA: branched- chain alpha-oxo acid dehydrogenase El alpha chain] [GN : bf mBAA ] 
[CL:pyruvate dehydrogenase (lipoamide) alpha chain: thiamine pyrophosphate -binding 
domain homology] [OR:Bacillus subtilis] [EC: 1.2.4.4] [DB:pir2] >gp : [GI :gl42611] 
[LN : BACBRANCH] [AC : M97391 : M9693 7] [PN:branched chain alpha-keto acid dehydrogenase] 
[OR: Bacillus subtilis] [SR: Bacillus subtilis DNA] [DB : genpept-bctl] [DE: Bacillus 
subtilis branched chain alpha-keto acid dehydrogenaseEl - alpha , branched chain 
alpha-keto acid dehydrogenase El-beta, andbranched chain alpha-keto acid dehydrogenase 
E2, complete cds . ] [LE:216] [RE:1208] [DIrdirect] >gp : [GI : dl013277 : gl303942] 
[LN:BACJH642] [AC : D84432 : D823 70] [PN : BfmBAA] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 
283 Kb region containing skin element.] [LE: 219562] [RE: 220554] [DI: direct] 
>gp: [GI : el 18 5673 :g2 634839] [LN : BSUB0013 ] [AC : Z99116 : AL009126] [PN : branched- chain 
alpha-keto acid dehydrogenase El] [GN: bfmBAA] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[EC:1.2.4.4] [DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 
2613730.] [NT:alternate gene name: bfmBla] [SP:P37940] [LE: 103065] [RE: 104057] 
[DI : complement] 

NT AA 

ORF Name NTID AAID — — , score probability 
LengthLength c 

A17503000982_24104702_ci_1109 | [325 | [4098 |p^T] [ 10 , 20 | [ 618 ||i.le-b4 — [ 



Description 

gp: [GI:gl041785] [LN : PYU3692 7] [AC:U36927] [PN:rhoptry protein] [FN: erythrocyte 
invasion and possible binding] [OR : Plasmodium yoelii] [DB : genpept - invl ] [DE : Plasmodium 
yoelii rhoptry protein gene, partial cds.] [LE:<1] [RE: 72 06] [DI: direct] 



NT AA 

ORF Name NTID AAID — — , , score pro bability 
LengthLength *- 



AI7503 000982_24117777_t3_858 [ [327 [ [4099 [[T75 [ [57 [ 



Description 
NO-HIT 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



score probability 



1AI7503000982 24225000 cl 1152 



T21T 



] [4100 ||55TT~ 



1.6e-33 



Description 

gp: [GI:g2194195] [LN: SGU61158] [AC:U61158] [PN:GdmF] [GN:gdmF] [OR : Staphylococcus 
gallinarum] [DB : genpept-bctl] [DE : Staphylococcus gallinarum Tue3928 GdmF (gdmF), 
putative membraneprotein (gdmH) , ABC transporter (gdmT) , and antibiotic 
galliderminprecursor (gdmA) genes, complete cds, putative membrane protein (gdmE) and 
modifying enzyme (gdmB) genes, partial cds.] [NT:proposed ABC transporter subunit 
(ATP-binding] [LE:179] [RE: 874] [DI : complement] 
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NT AA 

ORF Name NTID AAID Le ^ t h Length score probability 



[Al75030009^2_24226412_c3_14^6 | | 329 | [ 4101 | | 5 7 0 | | 189 | [gT7~|| i . 7e-4i 

Description 

sp: [LN:YQKGJBACSU] [AC:P54570] [GN : YQKG] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 21.0 
KD PROTEIN IN GLNQ-ANSR INTERGENIC REGION] [SP:P54570] [DB : swissprot] >pir : [LN : A6 9967] 
[AC:A69967 ] [PN : conserved hypothetical protein yqkG] [GN:yqkG ] [CLiyffH protein :mutT 
domain homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013319 : gl303984] 
[LN:BACJH642] [AC :D84432 : D82370] [PN : YqkG] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DErBacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 260584] [RE: 261141] [DI:direct] 

>gp: [GI:ell85630:g2634796] [LN:BSUB0013] [AC : Z99116 : AL009126] [GN:yqkG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NT:similar to hypothetical proteins] [SP:P54570] 
[LE : 62478] [RE : 63035] [DI : complement] 



NT AA 

ORF Name NTID AAID — — __ score probability 
LengthLength c JL 

|&I7S03000582_2422663SJ:2_4S9 | |330 | [4102 | jI55~| |Si | 

Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ T — score probability 
LengthLength 



A?7503000982_24228452^r3_942 | [331 | [4103 | p5~ | | 4S | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ „. — , , score proba bility 
LengthLength L - 



AI75030u0982_242255i5_c3_i423 | |332 | [4104 |pT7u"] p89 | pu~|| l . ie-38 



Description 

sp : [LN : YQ JE_BACSU] [AC:P54542] [GN : YQJE] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 39.7 
KD PROTEIN IN GLNQ-ANSR INTERGENIC REGION] [SP:P54542] [DB : swissprot] >pir : [LN : E6 9963 ] 
[AC:E69963 ] [PN : tripeptidase homolog yqjE] [GN:yqjE ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013291:gl303956] [LN : BACJH642] [AC :D84432 :D82370] [PN: Yqj E] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl)) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE:232856] 
[RE:233971] [DI: direct] >gp: [GI :ell85659:g2 634825] [LN : BSUB0013] [AC : Z99116 : AL009126] 
[GN:yqjE] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 13 of 21) : from 2395261to 2613730.] [NT: similar to 
tripeptidase] [SP:P54542] [LE: 89648] [RE: 90763] [DI : complement ] 

NT AA 

ORF Name NTID AAI D — , — , score probability 
LengthLength 



A!T7503000982_24229805_c3_i375 | |333 | [4105 |p55-| | 46i | pTIT] 



6.4e-ilS 



Description 

pir: [LN:A69581] [AC:A69581 ] [PN : acetyl -CoA carboxylase (biotin carboxylase subunit) 
accC] [GN:accC ] [CL: biotin carboxylase : biotin carboxylase homology] [OR: Bacillus 
subtilis] [DB:pir2] >gp: [GI : dl 013 248 :gl 303913] [LN : BACJH642] [AC : D84432 : D82370] 
[PN:YqhX] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl)) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:187900] [RE:189252] [DI:direct] >gp : [GI :ell85702 :g2 634868] [LN: BSUB0013] 
[AC:Z99116 :AL009126] [PN : acetyl -CoA carboxylase subunit (biotin] [GN:accC] 
[FN: long- chain fatty acid biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC:6.4.1.2] [DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 
2613730.] [NT:alternate gene name: yqhX] [SP:P49787] [LE:134369] [RE:135721] 
[DI : complement] 
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ORF Name 



NT ID 



NT AA 
AAID — « _ — scor e 
LengthLength 



probability 



117503000982 24257658 FT 309 



Description 
NO-HIT 



] I 334 1 1 4106 ll U2 Hl b3 1 



ORF Name 



NT ID AAID 



p7503000982_2425005i_c3_i353 



T31T 



TTUT 



NT AA 
LengthLength 
RTF 



score 



2TT" 



probability 
i.5e-17 



Description 

sp: [LN:YSXB_BACSU] [AC: P26 942 : Q4562 9] [GN: YSXB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
12.3 KD PROTEIN IN RPLU-RPMA INTERGENIC REGION (ORF X)] [SP : P26942 : Q45629] 
[DB:swissprot] >pir: [LN:S18440] [AC : S18440 : D21895 : B69987 ] [PN : conserved hypothetical 
protein ysxB] [GN: ysxB ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g40174] 
[LN:BSSPOIVFO] [AC : X59528 : S61796 ] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis spoIVFA, spoIVFB, L20, orfX and L24 genes.] [NT : ORF X] [SP:P26942] 
[LE:2270] [RE:2608] [DI:direct] >gp : [GI : ell84044 : g263 5260] [LN : BSUB0015] 
[AC : Z99118 : AL009126] [GN:ysxB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to hypothetical proteins] [SP:P26942] [LE: 59308] [RE: 59646] [DI : complement] 



ORF Name 



NT 



AA 



NT ID AAID — *^ — , , score probab ility 
LengthLength c z - 



1AI7503000982 24251592 c3 1518 



] I 336 I | 4108 |p^uT]|633 ||595 ||2.7e-100 



Description 

pir: [LN:D69907] [AC:D69907 ] [PN : hypothetical protein yojO] [GN:yojO ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell85410:g2 634331] [LN: BSUB0011] [AC : Z99114 : AL009126] 
[GN:yojO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900.] [LE:110914] [RE:112899] 
[DI complement] >gp : [GI :g3169331] [LN : AF026147] [AC : AF026147] [PN:YojO] [GN:yojO] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 2 ] [DE:Bacillus subtilis Yoj A 
(yojA) , YojB (yojB) , YojC (yojC) , YojD(yojD), YojE (yojE) , Yoj F (yojF), YojG (yojG) , 
YojH (yojH) , Yojl(yojl), YojJ (yoj J) , YojK (yojK) , YojL (yojL) , YojM (yojM) , 
YojN(yojN), and YojO (yojO) genes, complete cds; and OdhA (odhA) gene, partial cds . ] 
[LE: 12366] [RE: 14351] [DI: direct] 



ORF Name 



NT AA 
NT ID AAID — — , score 
LengthLength 



probability 



IAI7503000982 24255502 El 575 



Description 
NO-HIT 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score 



IA17503000982 24274192 t2 385 



][ 



[4110 |pi~||135 |pr 



probability 
1 . 5e-19 



Description 

sp : [LN : EBSB_ENTFA] [AC:P36921] [OR : ENTEROCOCCUS FAECALIS] [SR :, STREPTOCOCCUS FAECALIS] 
[DE : CELL WALL ENZYME EBSB] [SP:P36921] [DB : swissprot] >pir : [LN : B4 993 9] [AC:B49939 ] 
[PN:ebsB protein] [CL : Enterococcus faecalis ebsB protein] [OR : Enterococcus faecalis] 
[DB:pir2] >gp : [GI : g388108] [LN: ENEEBSA] [AC:L23802] [PN: cell wall enzyme] [GN : ebsB] 
[OR: Enterococcus faecalis] [SR: Enterococcus faecalis (strain OGlSSp) DNA] 
[DB : genpept -bct2] [DE : Enterococcus faecalis pore forming, cell wall enzyme, 
regulatory, and dehydroquinase homologue proteins (ebsA, ebsB, ebsC, and ebsD) genes, 
complete cds with repeat region.] [NT : putative] [LE:734] [RE: 1141] [DI:direct] 
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ORF Name 



NTID 



NT AA 
AAID — J _, T — _ score 
LengthLength 



probability 



1AI7503000982 2427b927 t'l 622 



][ 



[4111 |p^~| 



Description 
NO-HIT 



ORF Name 

|AI75030009a2 24^062(53 ci 1034 



NTID 



7v7vTr ^ NT AA 

AAID — , — , score 

LengthLength 



probability 



14112 



Description 
NO-HIT 



ORF Name 



NTID 



_ _ NT AA 

^ LengthLength 



[Al7b0i000992 24410300 c3 1439 



][ 



[4113 p^"" 



probability 
1.5e-67 



Description 

Sp : [LN : RLUB_BACSU] [AC:P35159] [GNrRLUB] [ORrBACILLUS SUBTILIS] [EC : 4 . 2 . 1 . 70] 
[DE: (PSEUDOURIDYLATE SYNTHASE) (URACIL HYDROLYASE) ] [SP:P35159] [DB : swissprot] 
>pir: [LN:S45555] [AC : S45555 : A69943 ] [PN : conserved hypothetical protein ypuL] [GN:ypuL 
] [CLrconserved hypothetical protein HI1243] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI :g410137] [LN : BACDIA] [AC:L09228] [ORrBacillus subtilis] [SR:Bacillus subtilis 
(strain 168, sub_species Marburg) DNA] [DB :genpept-bctl] [DErBacillus subtilis spoVA to 
serA region.] [NT:ORFX13] [LE:17440] [RE:18129] [DI:direct] >gp : [GI : ell85585 :g2634751] 
[LN:BSUB0013] [AC r Z99116 r AL009126] [GN:ypuL] [FNrunknown] [ORrBacillus subtilis] 
[DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 13 of 21) : from 
2395261to 2613730.] [NT:similar to hypothetical proteins] [SP:P35159] [LE:25454] 
[RE : 26143 ] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000982 24470317 ci 1011 



][ 



probability 
[4114 |[TuTu-|p39 ||i246 ||fe.ae-127 



Description 

sp : [LN : YQFA_BACSU] [AC:P54466] [GN : YQFA] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 35.6 
KD PROTEIN IN RPSU-PHOH INTEREGENIC REGION] [SP:P54466] [DB : swissprot] >pir : [LN : A69953 ] 
[AC:A69953 ] [PN : hypothetical protein yqfA] [GN:yqfA ] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:dl013152:gl303817] [LN: BACJH642] [AC :D84432 :D82370] [PN : YqfA] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain :JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 101161] 
[RE:102156] [DI:direct] >gp : [GI : el 183768 :g2 634984] [LN : BSUB0014] [AC : Z99117 : AL009126] 
[GN:yqfA] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 14 of 21) r from 2599451to 2812870.] [SPrP54466] [LErl7274] 
[RE r 18269] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID , , 

LengthLength 



score probability 



|A17503000^^2 244^0275 c3 1499 



] P 4 ^ | | 4115 — | |329 ||602 ||i.2e-5fl 



Description 

pir r [LNrA69653] [ACrA69653 ] [PN r transmembrane lipoprotein lplB] [GNrlplB ] 
[ORrBacillus subtilis] [DBrpir2] >gp r [GI r ell82690 rg2633024] [LNrBSUB0004] 
[ACrZ99107:AL009126] [PN: transmembrane lipoprotein] [GN:lplB] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DErBacillus subtilis complete genome (section 4 of 21) : from 600701 
to813890.] [LE:179900] [RE:180856] [DIrdirect] 
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NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



AI 7 5030009 82_244a633 0_ci^980 | | 344 | [ 4116 ||593 1 1 230 | pi~ | | 7 : 2e-29 



Description 

pir: [LN:H69978] [AC:H69978 ] [PN : conserved hypothetical protein yrrB] [GN:yrrB ] 
[CL: tetratricopeptide repeat homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83978:g2635194] [LN: BSUB0014] [AC : Z99117 : AL009126 ] [GNiyrrB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
14 of 21): from 2599451to 2812870.] [NTrsimilar to hypothetical proteins] [LE:208527] 
[RE:209147] [DI : complement] >gp : [GI : ell83996 : g2635212] [LN : BSUB0015] 

[AC:Z99118 :AL009126] [GNryrrB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21) : from 2795131to 3013540.] 
[NT: similar to hypothetical proteins] [LE: 12847] [RE: 13467] [DI : complement] 



NT AA 

ORF Name NT I D AAID — , — , score probability 
LerigthLength JL 



AJ75030u0982_244890g2_ti_l | |345 | [4117 |fl?5~| |4i ||72 ||0 . 017 



Description 

pir: [LN:D70083] [AC:D70083 ] [PN : hypothetical protein yxzC] [GN:yxzC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell84645:g2 636466] [LN:BSUB0021] [AC : Z99124 : AL009126] 
[GN:yxzC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 21 of 21): from 3999281to 4214814.] [LE:21405] [RE:21773] 
[DI : complement] 



NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



AI750300u982_2451157g_cl_1140 | |345 | [4118 | 81 | | 251 || 4.3e-21 



Description 

sp: [LN:OD02_BACSU] [AC:P16263] [GN : ODHB : CITM] [OR:BACILLUS SUBTILIS] [EC : 2 . 3 . 1 . 61] 
[DE: DEHYDROGENASE COMPLEX, (E2)] [SP:P16263] [DB : swissprot] >pir : [LN : B32879] 
[AC:B32879 :F69668 ] [PN : dihydrol ipoamide S-succinyltransf erase , odhB : 2-oxoglutarate 
dehydrogenase complex E2 component odhB: dihydrol ipoamide transsuccinylase odhB] 
[GN : odhB ] [CL : dihydrol ipoamide acetyltransf erase : lipoyl/biotin-binding homology] 
[OR:Bacillus subtilis] [EC : 2 . 3 . 1 . 61] [DB:pir2] >gp : [GI : gl43268] [LN : BACODHAB] 
[AC:M27141] [OR:Bacillus subtilis] [SR : B . subtilis (strain 3G18) DNA] [DB : genpept-bctl] 
[DE:B. subtilis 2-oxoglutarate dehydrogenase (odhA) gene 3' end, anddihydrol ipoamide 
transsuccinylase (odhB) gene, complete cds . ] [NT : dihydrol ipoamide transsuccinylase 
(OdhB; EC] [LE:899] [RE:2152] [DI:direct] >gp : [GI : ell85408 :g2634329] [LN:BSUB0011] 
[AC:Z99114 :AL009126] [PN : 2-oxoglutarate dehydrogenase complex] [GN : odhB] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [EC : 2 . 3 . 1 . 61] [DErBacillus subtilis complete genome 
(section 11 of 21): from 2000171to 2207900.] [NT : alternate gene name: citM] [SP:P16263] 
[LE : 106590] [RE : 107843] [DI : complement] 



ORF Name 


NT AA 

NT ID AAID . — — . , score probability 
LengthLength c L - 


|AI7503000982_24510877_cl_1035 


| |347 | |4119 519 | |I72 ||226 8.4e-19 | 



Description 

sp : [LN : AROK_LACLA] [AC:P43906] [GN : AROK] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC : 2 . 7 . 1 . 71] [DE : SHIKIMATE KINASE, (SK) ] 
[SP:P43906] [DB : swissprot ] >pir : [LN : S52581] [AC:S52581 ] [PN : shikimate kinase , ] 
[CL:shikimate kinase : shikimate kinase homology] [OR : Lactococcus lactis] [EC : 2 . 7 . 1 . 71] 
[DB:pir2] >gp : [GI :g683584] [LN : LLTYRAPH] [AC:X78413] [PN: shikimate kinase] [GN: aroK] 
[OR: Lactococcus lactis] [DB : genpept-bctl] [EC : 2 . 7 . 1 . 71] [DE:L. lactis tyrA, aroA, aroK 
and pheA genes.] [SP:P43906] [LE:2605] [RE:3093] [DI:direct] 
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NT AA 

ORF Name NT ID AAID Le ^ t ^Length score probability 

|AI7503000982_246ib91b_cJJ.b22 | | 348 | [ 4120 || 240 | |7 9 ||275 || b.4e-24 

Description 

gp: [GI:g2226349] [LN: AF003593] [AC : AF003593] [PN:CspC] [GN:CspC] [OR : Staphylococcus 
aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus CspC (cspC) gene, complete cds . ] 
[NT:similar to major cold-shock protein] [LE:444] [RE:644] [DI:direct] 



ORF Name 



F 



17503000982 24517177 c3 1433 



Description 
NO-HIT 



ORF Name 



1M7503000982 24540937 c2 1229 



Description 
NO-HIT 



NT AA 
NTID AAID — - T — , , score 
LengthLength 

] P 4 ^ 1 1 4121 IP^HI 42 1 



probability 



NTID AAID score 
LengthLength 



probability 



ORF Name 



NTID 



IAI7503000982 24S47187 cl 1038 



TTZT 



NT AA 

AAID , , 

LengthLength 



score probability 
53 ||8.8e-05 



Description 

gp: [GI:gl022725] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 
[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT :ORF2] 
[LE:394] [RE:1083] [DI : complement] >gp : [GI :g295162] [LN : STAMECRA] [AC:L14017] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain COL) DNA] 
[DB : genpept-bctl] [DE : Staphylococcus aureus methicillin-resistance protein (mecR) 
geneand unknown ORF, complete cds.] [NT:unknown ORF1; putative] [LE:1492] [RE:2181] 
[DI complement] 



ORF Name 



NTID 



AAID 



lAI7b03000982 24650016 c3 1478 



332" 



4124 



NT AA 
LengthLength 



T3TT 



score probability 
2.8e-ll 



TE3" 



Description 

sp: [LN:YPSB_BACSU] [AC:P50839] [GN : YPSB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 11.6 
KD PROTEIN IN COTD-KDUD INTERGENIC REGION] [SP:P50839] [DB : swissprot] >pir : [LN : E69941] 
[AC:E69941 ] [PN : hypothetical protein ypsB] [GN: ypsB ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:gll46182] [LN : BACPONAYPP] [AC:L47838] [GN:ypsB] [FN: hypothetical] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE:Bacillus subtilis (clone YAC15-6B) ponA gene, yppBCDEFG 
genes, ypqAE genes, yprAB genes, cotD gene, ypsABC genes, rnaP gene, yptAgene, ypuA 
gene, kduDI genes, kdgRKAT genes, ypwA gene, completecds 1 s . ] [NT : putative] [LE: 12192] 
[RE:12488] [DI:direct] >gp : [GI :ell83664:g2 634637] [LN : BSUB0012 ] [AC : Z 99115 : AL009126] 
[GN:ypsB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 12 of 21): from 2195541to 2409220.] [SP:P50839] [LE:135481] 
[RE : 135777] [DI : complement] 



ORF Name 



NTID 



^17503000982 24650252 r3 731 



T5T 



TT2"5~ 



NT AA 

AAID 

LengthLength 



score 



TTTT 



probability 



Description 

gp: [GI:el357086:g3955030] [LN : SAU17795] [AC:Y17795] [PN:unknown] [GN:prf A] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus prfA, pbp2 
genes.] [LE:731] [RE: 1357] [DI:direct] 
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NT AA 

ORF Name NTID AAID Le ^ t fr Le ^ th score probability 



AI7503000982JM6b8bfe2J:3_926 | | 3S4 | [ 4125 | p^7~| |428 | [14 7 5 |p . 7e-151 



Description 

pir: [LN:D69981] [AC:D69981 ] [PN : conserved hypothetical protein yrvN] [GNryrvN ] 
[CL: Haemophilus influenzae conserved hypothetical protein HI1590] [OR: Bacillus 

subtilis] [DB:pir2] >gp : [GI : ell83983 :g2635199] [LN: BSUB0014] [AC : Z99117 : AL009126] 

[GN : yrvN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis 

complete genome (section 14 of 21): from 2599451to 2812870.] [NT:similar to 

hypothetical proteins] [LE:212152] [RE:213417] [DI:direct] >gp : [GI : ell84001 :g2635217] 
[LN:BSUB0015] [AC : Z99118 : AL009126 ] [GN :yrvN] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 15 of 21): from 

2795131to 3013540.] [NT:similar to hypothetical proteins] [LE:16472] [RE:17737] 
[DI : direct] 

NT AA 

ORF Name NTID AAID — — , _ score probab ility 
LengthLength c *- 



tf7503000982_24797827_c3_1514 ] | [ 4127 || 185 T] | 518 | | i242 || 1 . 8e-i2£ 



Description 

sp: [LN:OD01_BACSU] [AC:P23129] [GN : ODHA : CITK] [OR : BACILLUS SUBTILIS] [EC:1.2.4.2] 
[DE : KETOGLUTARATE DEHYDROGENASE)] [SP:P23129] [DB : swissprot] >pir : [LN: A32879] 
[AC:S25295:A32879:E69668:S14544 ] [PN : oxoglutarate dehydrogenase (lipoamide) , ] [GN : odhA 
] [CL: oxoglutarate dehydrogenase (lipoamide) : thiamine pyrophosphate -binding domain 
homology] [OR:Bacillus subtilis] [EC:1.2.4.2] [DB:pir2] >gp : [GI:g40003] [LN : BSODHA] 
[AC:X54805:S43328] [PN : oxoglutarate dehydrogenase (NADP+) ] [GN : odhA] [OR: Bacillus 
subtilis] [DB : genpept-bctl] [DE : B . subtilis odhA gene for 2 -oxoglutarate 
dehydrogenase.] [SP:P23129] [LE:291] [RE:3104] [DI:direct] >gp: [GI :ell85409:g2 634330] 
[LN:BSUB0011] [AC : Z99114 : AL009126 ] [PN : 2 -oxoglutarate dehydrogenase (El subunit) ] 
[GN : odhA] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:1.2.4.2] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900.] [NT : alternate gene name: 
citK] [SP:P23129] [LE:107873] [RE:110686] [DI : complement] 



NT AA 

ORF Name NTID AAID — — , , score probabili ty 
LengthLength c -L 



A17503000982_24800461_ci_1138 | |356 | [4128 | p?l~| | 75 | | 235 || i.7e-18 



Description 

sp: [LN:OD01_BACSU] [AC:P23129] [GN : ODHA : CITK] [OR:BACILLUS SUBTILIS] [EC:1.2.4.2] 
[DE : KETOGLUTARATE DEHYDROGENASE)] [SP:P23129] [DB : swissprot ] >pir : [LN: A32879] 
[AC:S25295:A32879:E69668:S14544 ] [PN : oxoglutarate dehydrogenase (lipoamide) , ] [GN: odhA 
] [CL: oxoglutarate dehydrogenase (lipoamide) : thiamine pyrophosphate -binding domain 
homology] [OR:Bacillus subtilis] [EC:1.2.4.2] [DB:pir2] >gp : [GI :g40003] [LN : BSODHA] 
[AC:X54805 :S43328] [PN : oxoglutarate dehydrogenase (NADP+) ] [GN : odhA] [OR:Bacillus 
subtilis] [DB : genpept-bctl] [DE : B . subtilis odhA gene for 2 -oxoglutarate 
dehydrogenase.] [SP:P23129] [LE:291] [RE: 3104] [DI:direct] >gp : [GI :ell85409:g2 634330] 
[LN:BSUB0011] [AC : Z99114 : AL009126] [PN : 2 -oxoglutarate dehydrogenase (El subunit)] 
[GN : odhA] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:1.2.4.2] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900.] [NT:alternate gene name: 
citK] [SP:P23129] [LE:107873] [RE:110686] [DI : complement] 
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ORF Name 



NTID 



NT 



AA 



AAID « _ — _ 

LengthLength 



score probability 



AI750300098a_24HOV790_cl_1048 | [357 | [ 4129 ||909 | [302 || 687 || 1.2e-g T 



Description 

sp: [LN:ISPA_BACST] [AC:Q08291] [OR:BACILLUS STEAROTHERMOPHILUS] [EC : 2 . 5 . 1 . 10] [DE:(FPP 
SYNTHASE) ] [SP:Q08291] [DB : swissprot] >pir : [LN : JX0257] [AC:JX0257 ] 
[PN:geranyltranstransf erase, : farnesyl- diphosphate synthase] 

[CL : geranyltranstransf erase] [OR : Bacillus stearothermophilus] [EC : 2 . 5 . 1 . 10] [DB : pir2] 
>gp: [GI:dl003054:g391610] [LN : BACFDPS] [AC:D13293] [PN: farnesyl diphosphate synthase] 

[OR:Bacillus stearothermophilus] [SR:Bacillus stearothermophilus (strain :ATCC10149) 
DNA] [DB : genpept - be t 1 ] [EC : 2 . 5 . 1 . 10] [DE : B . stearothermophilus DNA for farnesyl 
diphosphate synthase , complete cds . ] [LE:85] [RE:978] [DIrdirect] 



ORF Name 



NTID 



^ m NT AA 

AAID — _ — , . score 

LengthLength 



probability 



LAI7503000982 24823311 c3 1404 



] |358 | [4130 |pT0~ ||69 |[75 ||Q.0055 



Description 

pir: [LN:B21124] [AC:B21124 ] [PN:Bkm-like sex- determining region hypothetical protein 
CS314] [GN:Bkm ] [OR :Drosophila melanogaster] [DB:pir2] 



ORF Name 



NTID 



NT AA 
AAID — , » _ — , , score 
LengthLength 



probability 



17503000982 24886562 t3 771 



3^9 



|4i3i ||T2i | pr 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID _ . , . 



^£7503000982 24886587 c2 1175 



T5TT 



[4132 



LengthLength 
2448 



score probability 



3TT 



1981 ||2.3e-207 



Description 

pir: [LN:A69979] [AC:A69979 ] [PN : conjugation transfer protein homolog yrrC] [GNryrrC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83977 :g2635193] [LN: BSUB0014] 
[AC:Z99117 :AL009126] [GN:yrrC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT:similar to conjugation transfer protein] [LE:206106] [RE:208502] [DI : complement] 
>gp: [GI:ell83995:g2635211] [LN : BSUB0015] [AC : Z99118 : AL009126] [GN:yrrC] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
15 of 21): from 2795131to 3013540.] [NT:similar to conjugation transfer protein] 
[LE : 10426] [RE : 12822] [DI : complement] 



ORF Name 



NT AA 

NTID AAID , — , score 

LengthLength 



probability 



IAI7503000982 2542188 ci 1026 



Description 
NO-HIT 



|361 | [4133 |pnr~[ p~ 



ORF Name 



NTID 



^ m NT AA 

AAID — _ — , . score 

LengthLength 



probability 



IAI7503000982 25423425 r2 337 



[4134 j|ZTu | [5T 



Description 
NO-HIT 

ORF Name 



NT AA 
NTID AAID — , — , , score 
LengthLength 



probability 



E 



Description 
NO -HIT 



] [363 | [4135 |p^~~| [75 | 
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NT AA 

ORFName NTID MID Le ~ thLe ~ th score probability 



[Ar7503000982_25b022i7_c2J.lb7 | | 364 | [ 4135 | pB~| | 204 || 410 | | 2.7e-3U 

Description 

sp: [LN:RUVA_BACSU] [AC:O05392] [GN : RUVA] [OR:BACILLUS SUBTILIS] [DE : PROBABLE HOLL I DAY 
JUNCTION DNA HELICASE RUVA] [SP:O05392] [DB : swissprot] >pir : [LN : E69702] [AC:E69702 ] 
[PN:Holliday junction DNA helicase ruvA] [GN : ruvA ] [CL:holliday junction DNA helicase 
ruvA] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84023 :g2635239] [LN : BSUB0015] 
[AC:Z99118:AL009126] [PNrHolliday junction DNA helicase] [GN: ruvA] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 15 of 21) : 
from 2795131to 3013540.] [SP:O05392] [LE:40296] [RE:40901] [D I : complement] 

NT AA 

ORF Name NTID AAID — _ — . , score probabilit y 
LengthLength i 



^17503000982 25595186 c2 1177 



365 | [4137 |pi~- 1 [216 | | 736 | | 7.6e-73 



Description 

pir: [LN:G69728] [AC:G69728 ] [PN:uridine kinase udk] [GN:udk ] [CLiuridine kinase] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell83963 : g2635179] [LN : BSUB0014] 
[AC:Z99117:AL009126] [PN:uridine kinase] [GN:udk] [ FN : pyr imidine salvage] [ORrBacillus 
subtilis] [DB:genpept-bctl] [EC : 2 . 7 . 1 . 48] [DE: Bacillus subtilis complete genome 
(section 14 of 21): from 2599451to 2812870.] [LE:192038] [RE:192673] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ — , _ score probability 
LengthLength 



AI7503000982_25598818_c3_1420 | | 366 | [ 4138 |[TS5?]| 563 | [ 1195 ||1 . 7e-121 



Description 

sp: [LN:RECN_BACSU] [AC : P17894 : P19671] [GN : RECN] [OR:BACILLUS SUBTILIS] [DE: DNA REPAIR 
PROTEIN RECN (RECOMBINATION PROTEIN N) ] [SP: PI 78 94 : PI 9671] [DB : swissprot] 
>pir: [LN:B35128] [AC : B35128 : PS0054 : H69690 ] [PN:DNA repair and genetic recombination 
protein recN] [GN: recN ] [CL:recN protein] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013258 :gl303923] [LN : BACJH642] [AC : D84432 :D82370] [PN : RecN] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 197319] 
[RE:199049] [DI:direct] >gp : [GI : gl43402] [LN : BACRECN] [AC:M30297] [OR:Bacillus 
subtilis] [SR:B. subtilis (strain 168) DNA] [DB : genpept-bctl] [DE :B . subtilis 
recombination and sporulation protein (recN, spoIVB) genes , complete cds, arginine 
hydroximate resistance (ahrC) gene, 3' end.] [NT : recombination protein (ttg start 
COdon)] [LE:131] [RE:1861] [DI:direct] >gp : [GI : ell85692 : g2634858] [LN : BSUB0013] 
[AC:Z99116 :AL009126] [GN : recN] [ FN : DNA repair and genetic recombination] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE: Bacillus subtilis complete genome (section 13 of 21) : 
from 2395261to 2613730.] [SP:P17894] [LE:124572] [RE:126302] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ T — , , score probability 
LengthLength c L 



A17503000982_25652217_c3^1470 | | 367 | [4139 | | 348 1 1 115 | 



Description 
NO-HIT 



162 



NT AA 

QRF Name ^ MI? Le^thLe^th ^^ probability 

{AI7S03000982Jtefc723J7_cl_1016 | |358 | [4140 ||7 80 1 1 259 || J71 || 3.5e-J4 | 

Description 

sp: [LN:YQXN__BACSU] [AC:P42095] [GN : YQXN: YQFI] [OR : BACILLUS SUBTILIS] [DE:(ORF3)] 
[SP:P42095] [DB : swissprot] >pir : [LN : H69968] [AC:H69968 ] [PN: conserved hypothetical 
protein yqxN] [GN :yqxN ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013162 : gl303827] 
[LN:BACJH642] [AC :D84432 :D82370] [PN:YqfI] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 109891] [RE: 110658] [DI : direct] 

>gp: [GI:ell85794:g2634960] [LN : BSUB0013] [AC : Z99116 : AL009126] [GN : yqxN] [FNrunknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete genome (section 

13 of 21): from 2395261to 2613730.] [NT : alternate gene name: yqfl; similar to 
hypothetical] [SP:P42095] [LE : 212962] [RE: 213729] [DI : complement] 

>gp: [GI:ell83758:g2634974] [LN : BSUB0014] [AC : Z99117 : AL009126] [GN : yqxN] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 

14 of 21): from 2599451to 2812870.] [NT : alternate gene name: yqfl; similar to 
hypothetical] [SP:P42095] [LE:8772] [RE: 9539] [DI : complement] 

NT AA 

QRF Name NTID AAID t — ^ t — , , score probability 
LengthLength c L ~ 

[&I7503000582_256802iS_c2_i20i | |369 | [4141 |p^~ | [241 | pg~]| ^> . 8e-40 "| 

Description 

sp : [LN : YQEZ_BACSU] [AC:P54465] [GN : YQEZ] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 46 . 5 
KD PROTEIN IN RPSU-PHOH INTEREGENIC REGION] [SP:P54465] [DB : swissprot] >pir : [LN: H69952] 
[AC:H69952 ] [PN: hypothetical protein yqeZ] [GN:yqeZ ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl013151:gl303816] [LN:BACJH642] [AC : D84432 : D823 70] [PN:YqeZ] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain :JH642 (trpC2 PheAl)) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 99826] 
[RE:101139] [DI:direct] >gp : [GI : ell83769 :g2634985] [LN: BSUB0014] [AC : Z99117 : AL009126] 
[GN:yqeZ] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 14 of 21): from 2599451to 2812870.] [SP:P54465] [LE:18291] 
[RE : 19604] [DI : complement] 



NT AA 

QRF Name NTID AAID — . — , , score probability 



LengthLength - 

&I75030P , 0982_250584g7_ci_i064 | |370 | [ 4142 | p^~| |70 | | 119 ||9.1e-07 



Description 

gp: [GI:g861340] [LN : CELE04F6] [AC:U28943] [GN:E04F6.7] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept -invl] [DE : Caenorhabditis 
elegans cosmid E04F6.] [NT:similar to ribitol dehydrogenase] [LE : 20096 : 20706 : 20893] 
[RE : 20659: 20814 : 21152] [DI : complement Join] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 

|AI7S03000982_26203942_c3_i3B8 | |371 | [ 4143 |pu~[ |209 ||g3g"~| |J . 0e-fc2 — | 

Description 

pir: [LN:A69969] [AC:A69969 ] [PN : conserved hypothetical protein yqzB] [GN : yqzB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85791 :g2634957] [LN:BSUB0013] 
[AC:Z99116:AL009126] [GN : yqzB] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21) : from 2395261to 2613730.] 
[NT: similar to hypothetical proteins] [LE: 208975] [RE: 209613] [DI : complement] 
>gp: [GI:ell83755:g2634971] [LN: BSUB0014] [AC : Z99117 : AL009126] [GN:yqzB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
14 of 21): from 2599451to 2812870.] [NT:similar to hypothetical proteins] [LE:4785] 
[RE: 5423] [DI : complement] 



163 



ORFJTame NTID AAI5 Le ^ thLe ^ th ^ore probability 



AI75030009a2__2620942! 3 _ci_1022 | [372 | [ 4144 || 106 F] |JbS | p7T|| 4 . 9e-1 7 2 



Description 

sp: [LN:RPSD_STAAU] [AC:P26766] [GN : RPOD : PLAC] [OR : STAPHYLOCOCCUS AUREUS] [DE : RNA 

POLYMERASE SIGMA FACTOR RPOD] [SP:P26766] [DB : swissprot ] >pir : [LN : S34442] [AC:S34442 ] 

[PN: transcription initiation factor sigma plaC] [GNrplaC ] [CL : transcription initiation 

factor sigma 43: transcription initiation factor sigma katF homology : transcription 

initiation factor sigma region 1 homology] [OR: Staphylococcus aureus] [DB:pir2] 

>gp: [GI:gl53069] [LN : STAPLAC] [AC:M63177] [PN: sigma factor] [GN:plaC] 

[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain SA2 0) DNA] 

[DB : genpept-bctl] [DE:S. aureus sigma factor (plaC) gene, complete cds.] [LE:820] 

[RE: 1926] [DI :direct] 



NT AA 

ORF Name NTID AAID — ^,_ T — , , sco re probability 
LengthLength 



AI7503000982_26212501_c3_1394 | |373 | [4145 | p2~] [283 || 79 7 | | 2.6e-79 



Description 

gp: [GI :g5019735] [LN: AF104349] [AC : AF104349] [PN : hydrophobic membrane protein ZurM] 
[GN: zurM] [OR:Listeria monocytogenes] [DB : genpept-bct2] [DE:Listeria monocytogenes 
zinc-like uptake operon, complete sequence.] [LE:860] [RE: 1747] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



IAI7503000982 26225463 cl 1070 1 \T7T 



[4146 



1741 I 



7.4e-82 



Description 

sp: [LN : RESD_BACSU] [AC:P35163] [GN : RESD] [OR:BACILLUS SUBTILIS] [DE : TRANSCRIPTIONAL 
REGULATORY PROTEIN RESD] [SP:P35163] [DB : swissprot] >pir : [LN:G69691] [AC :G6 9691 :S4 5559 
] [PN: two -component response regulator involved in aerobic and anaer resD] [GN: resD ] 
[CL:ompR protein: response regulator homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:g410141] [LN: BACDIA] [AC:L09228] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain 168, sub_species Marburg) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis spoVA to 
serA region.] [NT:ORFX17] [LE:21706] [RE:22428] [DI:direct] >gp : [GI : ell85581 : g2634 747] 
[LN:BSUB0013] [AC : Z99116 : AL009126] [PN : two -component response regulator] [GN: resD] 
[FN: activation role in global regulation of aerobic] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 13 of 21) : from 
2395261tO 2613730.] [NT : alternate gene name: ypxD] [SP:P35163] [LE:21155] [RE:21877] 
[DI : complement] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI75030009B2_26251577_ci_994 1 |375 | [ 4147 | 117 | ^T2~ || 2 . 8e-34 



Description 

Sp : [LN : YQEL_BACSU] [AC:P54457] [GN : YQEL] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 13.3 

KD PROTEIN IN AROD-COMER INTERGENIC REGION] [SP:P54457] [DB : swissprot] >pir : [LN : H69951] 
[AC:H69951 ] [PN:ybeB protein homolog yqeL] [GN:yqeL ] [CL : Escherichia coli ybeB 

protein] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI :dl013128 :gl303793] [LN:BACJH642] 
[AC:D84432:D82370] [PN : YqeL] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 

region containing skin element.] [LE: 76417] [RE: 76773] [DI:direct] 

>gp: [GI:ell83792 :g2635008] [LN : BSUB0014] [AC : Z99117 : AL009126] [GN:yqeL] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
14 of 21): from 2599451to 2812870.] [NT:similar to hypothetical proteins] [SP:P54457] 
[LE : 42657] [RE : 43013] [DI : complement] 
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NT AA 

QRF Name NTID AAID Le ^ t kLength score probability 



A1 7 50:4000982J>62b96b7_ci_iu5i | | 3 7 6 | [4148 [p^F] |474 |[IT^"] 



J.Oe-117 



Description 

sp: [LN:DLD2_BACSU] [AC:P54533] [GN:BFMBC] [OR:BACILLUS SUBTILIS] [EC:1.8.1.4] 
[DE : DEHYDROGENASE) (LPD-VAL) ] [SP : P54533] [DB : swissprot] 

NT AA 

QRF Name NTID AAID — — score probability 



LengthLength - 

A17503tt00382J>6m7i3_c2_i332 | |377 | [4149 |pT7~] p05 || 580 ||2.6e-B6 



Description 

pir : [LN:S72642] [AC:S72642 ] [PN:probable ABC-type transport protein xynB] [GN: xynB ] 
[OR:Thermoanaerobacterium thermosulf urigenes] [DB:pir2] >gp : [GI : gl255237] [LN: TTU50952] 
[AC:U50952] [PN:XynB] [GNrxynB] [OR : Thermoanaerobacterium thermosulfurigenes] 
[DB:genpept-bctl] [DE : Thermoanaerobacterium thermosulfurigenes endoxylanase 
precursor (XynA) and membrane component of an ABC transporter (XynB) genes , complete cds 
and XynC (xynC) gene, partial cds.] [NT: Description: membrane component of an ABC] 
[LE:454] [RE:1359] [DI:direct] 

NT AA 

QRF Name NTID AAID — _ , — . score probab ility 
LengthLength c L 



AI750^Q00982^2g3g89b0_c2_1264 | |378 | [4150 | pgTT| | 459 | [7^^| |4 . 7 e- 7 5 



Description 

sp : [LN : RECQ_BACSU] [AC:P50729] [GN:RECQ] [OR:BACILLUS SUBTILIS] [EC:3.6.1.-] 
[DE : ATP -DEPENDENT DNA HELICASE RECQ, ] [SP:P50729] [DB : swissprot ] >pir : [LN : A69691] 
[AC:A69691 ] [PN : ATP -dependent DNA helicase homolog : ATP- dependent DNA helicase recQ] 
[GN : recQ ] [CL:DEAD/H box helicase homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83747:g2634720] [LN : BSUB0012] [AC : Z99115 : AL009126] [PN : ATP -dependent DNA 
helicase] [GN: recQ] [OR: Bacillus subtilis] [DB : genpept-bctl] [EC: 3. 6.1.-] [DE: Bacillus 
subtilis complete genome (section 12 of 21): from 2195541to 2409220.] [SP:P50729] 
[LE:210616] [RE:212106] [DI : complement ] >gp: [GI :ell85571:g2 634737] [LN : BSUB0013 ] 
[AC:Z99116 :AL009126] [PN : ATP -dependent DNA helicase] [GN: recQ] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [EC: 3. 6.1.-] [DE: Bacillus subtilis complete genome (section 13 of 
21): from 2395261to 2613730.] [SP:P50729] [LE:10896] [RE:12386] [DI : complement] 
>gp: [GI :gll46200] [LN : BACSERA] [AC:L47648] [PN : DNA or RNA helicase, DNA-dependent 
ATPase] [GN:recS] [FN: DNA repair and homologous recombination] [OR: Bacillus subtilis] 
[DB : genpept-bct2] [DE:Bacillus subtilis phosphoglycerate dehydrogenase (serA) , 
ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate dehydrogenase 
(ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, ypfB, cytidine 
monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 

dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete cds.] 
[NT: similar to Escherichia coli recQ gene product;] [LE:4291] [RE: 5781] [DI: direct] 
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NT AA 

QRF Name NTID AAID thLe — th score probability 



|AI7B03000982JJfcJB4feB2_clJ.092 | |3 7 9 | {4151 | |1248 1 1415 ||2.6e-!>4 

Description 

spr [LNrYPIA_BACSU] [AC:P54389] [GN : YPIA] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 48 . 3 
KD PROTEIN IN QCRA-AROE INTERGENIC REGION] [SP:P54389] [DB : swissprot] >pir : [LN : E69936] 

[AC:E69936 ] [PN : conserved hypothetical protein ypiA] [GNrypiA ] [CL : tetratricopeptide 
repeat homology] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :gll46224] [LN : BACYPIA] 

[AC:L47709] [GNrypiA] [FN : hypothetical] [ORrBacillus subtilis] [DB :genpept-bctl] 

[DErBacillus subtilis (clone YAC15-6B) ypiABF genes, qcrABC genes , ypjABCDEFGHI genes, 
birA gene, panBCD genes, dinG gene, ypmB gene,aspB gene, asnS gene, dnaD gene, nth gene 
and ypoC gene, completecds ' s . ] [NT : puta t i ve ] [LE:348] [RE:1619] [DI:direct] 
>gp: [GI :ell83704 :g2 634677] [LN : BSUB0012] [AC : Z99115 : AL009126] [GN : ypiA] [FNrunknown] 

[ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 
12 of 21) r from 2195541to 2409220.] [NTrsimilar to hypothetical proteins] [SPrP54389] 

[LE r 170048] [RE r 171319] [DI r complement] 

NT AA 

QRF Name NTID AAID — — . , score probability 

LengthLength c -L 



A17S03000962_26S97ia6_cl_10S5 | |380 | [4152 |p^"] |147 ||544 ||1.7e-B2 



Description 

spr [LNrYQIW_BACSU] [AC:P54534] [GNrYQIW] [OR r BACILLUS SUBTILIS] [DE r HYPOTHETICAL 16.2 
KD PROTEIN IN BMRU-ANSR INTERGENIC REGION] [SPrP54534] [DB r swissprot] >pir r [LN r E69962] 
[ACrE69962 ] [PN r conserved hypothetical protein yqiW] [GNryqiW ] [ORrBacillus subtilis] 
[DBrpir2] >gpr [GI : dl013283 rgl303948] [LN r BACJH642] [AC r D84432 r D82370] [PNrYqiW] 
[ORrBacillus subtilis] [SRrBacillus subtilis (strain r JH642 (trpC2 PheAl) ) DNA] 
[DBr genpept-bctl] [DErBacillus subtilis DNA, 2 83 Kb region containing skin element.] 
[LEr226148] [REr226585] [DIrdirect] >gp r [GI r ell8 56 67 rg2 634833] [LN r BSUB0013] 
[ACrZ99116 rAL009126] [GNryqiW] [FNrunknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DErBacillus subtilis complete genome (section 13 of 21) r from 2395261to 2613730.] 
[NTrsimilar to hypothetical proteins from B. subtilis] [LEr 97034] [RE r 97471] 
[DI r complement] 



NT AA 

QRF Name NTID AAID — _ — . score probability 
Length Length 



Al7S03000962_26600137_t2_493 | pgi | [4153 |pi~|| 176 || 188 | 



8.9e-15 



Description 

sp r [LN r YPUF_BACSU] [ACrP17617] [GN: YPUF] [OR r BACILLUS SUBTILIS] [DE r HYPOTHETICAL 21.0 
KD PROTEIN IN RIBT-DACB INTERGENIC REGION (ORFX6 ) ] [SP:P17617] [DB r swissprot] 
>pirr [LN:S45548] [AC r S45548 r E6 9942 ] [PN r hypothetical protein ypuF] [GNrypuF ] 
[ORrBacillus subtilis] [DBrpir2] >gp r [GI r g410130] [LNrBACDIA] [AC:L09228] [ORrBacillus 
subtilis] [SRrBacillus subtilis (strain 168, sub_species Marburg) DNA] 
[DBrgenpept-bctl] [DErBacillus subtilis spoVA to serA region.] [NT:ORFX6] [LErl2218] 
[RErl2742] [DI r complement] >gp : [GI :g580916] [LNrBSRIB] [ACrX51510] [ORrBacillus 
subtilis] [DBrgenpept-bctl] [DE r B . subtilis riboflavin biosynthesis operon ribG, ribB, 
ribA, ribH , and ribT genes.] [NTrORF Y (AA 1-174)] [SP:P17617] [LEr5164] [REr5688] 
[DI r complement] >gp r [GI r ell85592 r g2634758] [LNrBSUB0013] [AC : Z99116 r AL009126] [GNrypuF] 
[FNrunknown] [ORrBacillus subtilis] [DBrgenpept-bctl] [DErBacillus subtilis complete 
genome (section 13 of 21) r from 2395261to 2613730.] [SPrP17617] [LEr30841] [RE:31365] 
[DI r direct] 
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NT AA 

ORF Name NT ID AAID Le ^ t frLength score probability 



AI75030009B2J^7i>2Ji2_c3_1396 | | 382 | [4154 ||ivr~| |S6 ||229 ||4.0e-19 



Description 

Sp: [LN: RL33__BACST] [AC:P23375] [GN : RPMG] [OR : BACILLUS STEAROTHERMOPHILUS] [DE:50S 
RIBOSOMAL PROTEIN L3 3] [SP:P23375] [DB : swissprot] >pir : [LN: B48396] [AC:B48396 ] 
[PN: ribosomal protein L33] [CL : Escherichia coli ribosomal protein L33] [OR:Bacillus 
s tearothermophilus ] [DB : pir2 ] 

NT AA 

ORF Name NTID AAID , — ^, , — ^, score probability 
LengthLength ^ 



AI75030009a2_26754652_c3_i454 | |383 | [4155 | pT" 



514 ||2.Se-49 



Description 

sp: [LN:KCY_BACSU] [AC:P38493] [GN : CMK : JOFC] [OR:BACILLUS SUBTILIS] [EC : 2 . 7 . 4 . 14] 
[DE: (CMP KINASE) ] [SP:P38493] [DB : swissprot] >pir : [LN : F69601] [AC:F69601 ] 
[PN:cytidylate kinase cmk] [GN:cmk ] [CL : cytidylate kinase cmk] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI:g533105] [LN : BSU11687] [AC:U11687] [PN:unknown] [GN:jofC] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 168 jofA, jofB, MssA 
homolog (jofC) and ribosomalprotein SI homolog (jofD) genes, complete cds, and joeB 
gene, partial cds.] [NT: similar to the Escherichia coli mssA gene product] [LE:1744] 
[RE:2418] [DIrdirect] >gp: [GI : ell83 734 : g2634707] [LN : BSUB0012 ] [AC : Z99115 : AL009126] 
[PN: cytidylate kinase] [GN:cmk] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 4 . 14] 
[DE:Bacillus subtilis complete genome (section 12 of 21): from 2195541to 2409220.] 
[NT:alternate gene name: jofC, ypfC] [SP:P38493] [LE:199742] [RE:200416] 
[DI: complement] >gp: [GI :ell85558:g2 634724] [LN: BSUB0013] [AC : Z99116 : AL009126] 
[PN: cytidylate kinase] [GN:cmk] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 4 . 14] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT:alternate gene name: jofC, ypfC] [SP:P38493] [LE:22] [RE:696] [DI : complement] 
>gp: [GI : gll46214] [LN : BACSERA] [AC:L47648] [PN:cytidine monophosphate kinase] [GN:cmk] 
[FN:cytidine diphosphate biosynthesis] [OR:Bacillus subtilis] [DB : genpept -bet 2 ] 
[DE:Bacillus subtilis phosphoglycerate dehydrogenase (serA) , ypaA, f erredoxin (fer) , 
ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate dehydrogenase (ypcA) , ypdA, ypdB, 
ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, ypfB, cytidine monophosphatekinase 
(cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ dependentglycerol- 3 -phosphate dehydrogenase 
(glyc) , yphE and yphF genes, complete cds.] [NT:similar to Escherichia coli smba 
supress; putative] [LE: 15981] [RE: 16655] [DI: direct] 

. tn.imi.L mill.— u,K ...■.■»j j ..» j f..«»ii| I ..u^iii.i..i.. ■ .. ... .iiim.-... ...|.|...|.|...... ■■...,»... ........ . m ...I. ii I.. ■ ., ■■ ...in. .. .., ,..,,.„ .. I .„ ) , u i,„i.„ 1 „ w »„,.. lll i, ■I..-,- ■ .I..II.IIH ..I.... m , r m., ........ , ......in. .i I. n i.iiii . 

NT AA 

ORF Name NTID AAID — — . , score probabili ty 
LengthLength 



AI7S03000982_2fi7S47BO_c2J.i92 | |384 | | 4i56 |[TT5T] p87 | p5 | | l.le 



Description 

pir: [LN:E69981] [AC:E69981 ] [PN:NifS protein homolog homolog yrvO] [GN:yrvO ] 
[CL:nitrogen fixation protein nifS] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell83981:g2635197] [LN:BSUB0014] [AC : Z99117 : AL009126] [GN:yrvO] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
14 of 21): from 2599451to 2812870.] [NT:similar to NifS protein homolog] [LE:210475] 
[RE:211515] [DI : complement] >gp : [GI :ell83999:g2 635215] [LN : BSUB0015] 

[AC:Z99118 :AL009126] [GN:yrvO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to NifS protein homolog] [LE: 14795] [RE: 15835] [DI : complement] 
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ORF Name 



NT AA 
NT ID AAID — T — , . score 
LengthLenqth 



probability 



IAI7503000982 26772iJb cl 1000 



] [385 | [ 415 7 [[gB "| [331 | [55 5 | | l.le-53 



Description 

sp: [LN:YQEN_BACSU] [AC:P54459] [GN: YQEN] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 40 . 5 
KD PROTEIN IN COMEC-RPST INTERGENIC REGION] [SP:P54459] [DB : swissprot] >pir : [LN : B69952] 
[AC:B69952 ] [PN: conserved hypothetical protein yqeN] [GNryqeN ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl013134 : gl303799] [ LN : BAC JH6 4 2 ] [AC : D84432 : D823 70] [PNrYqeN] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl) ) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] 
[LE:82475] [RE:83518] [DI: direct] >gp : [GI :ell83786 :g2 635002] [LN : BSUB0014] 
[AC:Z99117:AL009126] [GN:yqeN] [FN:unknovm] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 14 of 21): from 2599451to 2812870.] 
[NT: similar to hypothetical proteins] [SP:P54459] [LE: 35912] [RE: 36955] [DI : complement] 



ORF Name 



NT ID 



NT AA 
AAID — — , , score 
LengthLength 



E 



27503000982 26774062 c2 1248 



[386 | [4158 ||i332~|pT 



probability 
1670 ||8.0e-172 



Description 

sp: [LN:YPHC_BACSU] [AC:P50743] [GN : YPHC] [OR:BACILLUS SUBTILIS] [DE : REGION] [SP:P50743] 
[DB: swissprot] >pir: [LN:A69936] [AC:A69936 ] [PN: conserved hypothetical protein yphC] 
[GN:yphC ] [CL: Mycobacterium leprae probable GTP-binding protein : translation elongation 
factor Tu homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : el!83729 : g2634702] 
[LN:BSUB0012] [AC : Z99115 : AL009126 ] [GN:yphC] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 12 of 21) : from 
2195541to 2409220.] [NT:similar to hypothetical proteins] [LE:193903] [RE: 195213] 
[DI: complement] >gp : [GI : gll46219] [LN : BACSERA] [AC:L47648] [GN:yphC] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis phosphoglycerate dehydrogenase 
(serA) , ypaA, ferredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate 
dehydrogenase (ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, 
ypfB, cytidine monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 
dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes , complete cds.] 
[NT: similar to Escherichia coli GTP-binding protein] [LE: 21184] [RE: 22494] [DI: direct] 



ORF Name 



U.7503000982 2756288 c2 1257 



Description 
NO-HIT 



ORF Name 



E 



17503000983 10000128 t2 399 



Description 
NO-HIT 



NT AA 
NT ID AAID — _ — _ score 
LengthLength 



probability 



2 [387 | [4159 |[Tg3~||50 | 



NT ID AAID 



NT 



AA 



LengthLength 



score 



probability 



[388 [ [4160 ||T5g— | p" 
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NT AA 

ORFName NTID AAID Le ~ thLe ~ th score probability 



AiyB03000983J.019S942_c3J.0S6 | [389 | [ 4161 | p5~| | 2il | p^~| |l . ^e-21 



Description 

sp: [LN:TRPF_THEMA] [AC:Q56320] [GN : TRPF] [OR : THERMOTOGA MARITIMA] [EC : 5 . 3 . 1 . 24] 
[DE:N- (5 1 - PHOSPHOR I BOS YL ) ANTHRANI LATE ISOMERASE, (PRAI)] [SP:Q56320] [DB : swissprot ] 
>pir: [LN:S59048] [AC:S59048 ] [PN: phosphor ibosyl anthranilate isomerase trpF] [GN; trpF 
] [CL:phosphoribosylanthranilate isomerase : trpF homology] [OR : Thermo toga maritima] 
[DB:pir2] >gp : [GI :gl054860] [LN: TMTRABCDF] [AC:X92729] [PN : phosphoribosyl anthranilate 
isomerase] [GN : trpF] [ORrThermotoga maritima] [DB :genpept-bctl] [DE :T .maritima trpD, 
trpC # trpF, trpB, and trpA genes.] [SP:Q56320] [LE:1012] [RE: 1629] [DI:direct] 
>gp: [GI:g4980631] [LN : AE001699] [AC : AE001699 : AE000512] [PN : phosphoribosylanthranilate 
isomerase] [GN:TM0139] [OR : Thermo toga maritima] [DB :genpept-bct2] [DE : Thermotoga 
maritima section 11 of 136 of the complete genome.] [NT: similar to PID: 1054860 
SP:Q56320 GB:AE000512] [LE:7969] [RE:8586] [D I : complement] 



NT AA 

ORF Name NTID AAID — • — , _ score proba bility 
LengthLength c z - 

|M75030005a3_i0553765_t2_3S7 | [390 | [ 4162 || 19S 1 1 54 | 

Description 
NO-HIT 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

AI7S03000983J0Bfl463_c3J78 | [391 | [4153 |[ZT2T] | 706 | [ 2472 |[8T3 e-2bV 



Description 

pir: [LN:S70691] [AC : S70691 : C6 9680 ] [PN : polyribonucleotide nucleotidyltransferase, 
alpha chain pnpA: polynucleotide phosphorylase pnpA] [GN:pnpA ] [CL : polyribonucleotide 
nucleotidyltransferase alpha chain] [OR:Bacillus subtilis] [EC:2.7.7.8] [DB:pir2] 
>gp: [GI:ell85260:g2634041] [LN: BSUB0009] [AC : Z99112 : AL009126] [PN polynucleotide 
phosphorylase (PNPase) ] [GN : pnpA] [FN:necessary for competence development] 
[OR:Bacillus subtilis] [DB : genpept -be t 1 ] [EC:2.7.7.8] [DE:Bacillus subtilis complete 
genome (section 9 of 21): from 1598421to 1807200.] [NT:alternate gene name: comR] 
[SP:P50849] [LE:140354] [RE:142471] [DI: direct] >gp : [GI : gll8468 0] [LN : BSU29668] 
[AC:U29668] [PN : polynucleotide phosphorylase] [GN :pnpA] [OR:Bacillus subtilis] 
[DB: genpept -bet 2] [DE:Bacillus subtilis ribosomal protein RpsO (rpsO) gene, partial 
cds,and polynucleotide phosphorylase (pnpA) gene, complete cds.] [LE:224] [RE:2341] 
[DI: direct] 

NT AA 

ORF Name NTID AAID — _ — , _ score probability 
LengthLength c 



AI7S030009S3J06S792b_c3J017 | | 392 | [4154 |pw-| |3iS | pi~]| i . 4e-55 



Description 

pir: [LN:H69722] [AC :H6 9722 :B25364 ] [PN : homos erine kinase,] [GN:thrB ] [CL : homoserine 
kinase thrB] [OR:Bacillus subtilis] [EC : 2 . 7 . 1 . 39] [DB:pir2] >gp : [GI : ell84303 :g2635721] 
[LN:BSUB0017] [AC : Z99120 : AL009126] [PN : homoserine kinase] [GN:thrB] [FN: threonine 
biosynthesis] [OR:Bacillus subtilis] [DB : genpept -bet 1] [EC : 2 . 7 . 1 . 39] [DE:Bacillus 
subtilis complete genome (section 17 of 21): from 3197001to 3414420.] [NT : alternate 
gene name: thrA] [LE: 114902] [RE: 115831] [DI : complement] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength • L - 

p7503000983_10g57002_t2_278 | |393 | [ 4155 1 1 62 | 

Description 
NO-HIT 
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ORF Name 



NT AA 
NT ID AAID — « _ — . . score 
LenqthLength 



probability 



[AI7503000983 1074090 cl 728 



j {394 | [4166 |pg— 1|301 |p7~]| 1.0e-68 - 



] 



Description 

sp: [LN:CODV_BACSU] [AC:P39776] [GN:CODV] [OR:BACILLUS SUBTILIS] [DE : PROBABLE 

INTEGRASE / RECOMB INASE CODV] [SP:P39776] [DB : swissprot] >pir : [LN : G69601] 

[AC:G69601:S61493 :S72309 ] [PN : integrase/recombinase codV] [GNrcodV ] [CL iprobable 

site-specific integrase/recombinase XerC] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:g535348] [LN:BSU13634] [AC:U13634] [PNrCodV] [GNrcodV] [OR: Bacillus subtilis] 

[DB : genpept-bctl] [DE: Bacillus subtilis JH642 dipeptide permease operon regulators, 

codV,codW, codX, and codY genes, complete cds . ] [LE:293] [RE: 1207] [DI:direct] 

>gp: [GI:ell85205:g2633986] [LN: BSUB0009] [AC : Z99112 : AL009126] 

[PN: integrase/recombinase] [GN:codV] [OR: Bacillus subtilis] [DB: genpept-bctl] 

[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 

[SP:P39776] [LE:88166] [RE:89080] [DI: direct] 



ORF Name 



NT ID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7503000983 10975428 t2 340 



][ 



[4157 jjgvg 1 |292 |pr 



probability 

mm 



Description 

sp : [ LN : B I CD_DROME ] [AC:P16568] [GN : BICD] [OR : DROSOPHILA MELANOGASTER] [SR:, FRUIT FLY] 
[DE : CYTOSKELETON-LIKE BICAUDAL D PROTEIN] [SP:P16568] [DB : swissprot] >pir : [LN : A33636 ] 
[AC:A33636 ] [PN:bicaudal protein D] [GN:BicD ] [OR : Drosophi la melanogaster] [DB:pir2] 
>gp: [GI :gl57006] [LN : DROBICD] [AC:M31684] [OR : Drosophila melanogaster] 
[SR:D. melanogaster (strain DfTW119) embryo, cDNA to mRNA] [DB : genpept-invl] 
[DE :D. melanogaster cytoskeleton-like bicaudalD protein (BicD) mRNA, complete cds.] 
[NT:bicaudalD protein] [LE:132] [RE:2480] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID — — 
LengthLength 



H7503000983 1179775 tl 70 



[4158 |pTT2 | |3T" 



score probability 
110.029 



Description 

pir: [LN:S78676] [AC : S78676 : S78677 ] [PN : hypothetical protein Q0144] [OR : mitochondrion 
Saccharomyces cerevisiae] [DB:pir2] >gp: [GI : el370708 :g4160384] [LN: SCE011856] 
[AC:AJ011856] [OR : Mitochondrion Saccharomyces cerevisiae] [SR:baker's yeast] 
[DB:genpept-plnl] [DE : Saccharomyces cerevisiae complete mitochondrial genome.] [NT : ORF 
Q0144] [LE:54109] [RE:54438] [DI: direct] 



ORF Name 



NT AA 

NTID AAID — _ , . score 

LengthLength 



probability 



IA17503000983 11817625 EI 65 



[4169 



T25 I Wl 



Description 
NO-HIT 



ORF Name 



NTID 



p7503000983_il83337_c2_847 



][ 



NT AA 
AAID — , — , 
LengthLength 

[4170 



score probability 
1303 ||5.8e-27 



Description 

sp: [LN:NUSA_BACSU] [AC:P32727] [GN: NUSA] [OR:BACILLUS SUBTILIS] [DE : N UTILIZATION 
SUBSTANCE PROTEIN A HOMOLOG (NUSA PROTEIN) ] [SP:P32727] [DB : swissprot] >pir : [LN : C36905] 
[AC:C36905:B69668:S31991 ] [PN : transcription termination-antitermination factor nusA] 
[GN : nusA ] [CL: Bacillus transcription termination factor nusA: transcription 
termination factor nusA homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g49316] 
[LN : BSORF1T7A] [AC:Z18631] [GN:ORF2] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis infB-nusA operon.] [SP:P32727] [LE:961] [RE:2076] [DI:direct] 



170 



ORF Name 

IAIVB030009B3 11U44U02 c2 941 



Description 
NO-HIT 



ORF Name 



IAI7503000983 11926527 t3 614 



Description 
NO-HIT 



NT AA 
NT ID AAID — , — _ score 
LengthLength 



probability 



] [399 | [4171 |p^7~| |108 | 



NT ID 



4ocr 



j. _ NT AA 

AAID — — , . score 

LengthLength 



probability 



[4172 |[T7T | [FT" 



ORF Name 



NT 



AA 



NTID AAID — , 

LengthLength 



IAI7503000983 11930317 c3 974 



][ 



[4173 | [TUT 



score probability 
1.9e-21 



Description 

sp: [LN:YLXQ_BACSU] [AC:P32729] [GN: YLXQ] [OR : BACILLUS SUBTILIS] [DE : PROBABLE RIBOSOMAL 
PROTEIN IN NUSA-INFB INTERGENIC REGION (ORF4) ] [SP:P32729] [DB : swissprot] 
>pir: [LN:E36905] [AC : E36905 : C6 9882 : S31993 ] [PN : ribosomal protein L7AE family homolog 
ylxQ: hypothetical protein 2 (infB 5' region)] [GN:ylxQ ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI :g49318] [LN: BSORF1T7A] [AC:Z18631] [GN: ORF4] [OR : Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis infB-nusA operon.] [SP:P32729] [LE:2367] [RE:2669] 
[DI:direct] >gp: [GI : ell85253 :g2634034] [LN : BSUB0009] [AC : Z99112 : AL009126] [GNrylxQ] 
[FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete 
genome (section 9 of 21): from 1598421to 1807200.] [NT:alternate gene name: ymxC; 
similar to ribosomal] [SP:P32729] [LE:134658] [RE:134960] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID — _ — ^, score probability 
LengthLength 



IAI7503000983 1212785 ri 108 



|402 | [4174 |pT3 — [|70 ||74 ||0.048 



Description 

pir: [LN:G70047] [AC:G70047 ] [PN : conserved hypothetical protein yvrM] [GN:yvrM ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84405 : g2635823] [LN : BSUB0017] 
[AC:Z99120 :AL009126] [GN:yvrM] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 17 of 21): from 3197001to 3414420.] 
[NT: similar to hypothetical proteins] [LE: 214190] [RE: 214951] [DI : complement] 
>gp: [GI:ell86014 :g2635839] [LN: BSUB0018] [AC : Z99121 : AL009126 ] [GN : yvrM] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
18 of 21): from 3399551to 3609060.] [NT:similar to hypothetical proteins] [LE:11640] 
[RE : 12401] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



probability 



17^03000983 12694082 c2 889 



j |403 | [4175 |[192 ||53 | 



Description 
NO-HIT 



ORF Name 



AI7503000983_12703763_r3_592 

Description 

NO-HIT 



NTID 



™ TT% NT AA 

AAID — — score 

LengthLength 



probability 



[4175 



Ilb9 I 



FT" 



171 



NT AA , 
ORF Name NTID AAID Le ~ thLe ^ t fr SCQre probability 



|Ai7503000983J.iUa7i>lJ_c2J)07 | | 405 | [41 7 7 ||3033||1010 || 417 || 4.1e-6b 

Description 

pir: [LN:D71302] [AC:D71302 ] [PN:probable exonuclease (sbcC) ] [GN:TP0627 ] 
[ORrTreponema pallidum subsp. pallidum] [SR:, syphilis spirochete] [DB:pir2] 

>gp: [GI :g3322922] [LN : AE001237] [AC : AE001237 : AE000520] [PN : exonuclease (sbcC) ] 
[GN:TP0627] [OR : Treponema pallidum] [DB : genpept-bct2] [DE : Treponema pallidum section 53 

of 87 of the complete genome.] [NT:similar to SP:P13458 GB:X15981 PID:145327 PID:42914] 
[LE: 12418] [RE:15561] [DI: direct] 



ORF Name 


NTID 


NT AA 

AAID 

Length Length 


score 


probability 


AI7503000983_i3i7%92_ci_724 


|40S 


|4i78 ||132 ||43 | 






Description 
NO -HIT 










ORF Name 


NTID 


^ TT% NT AA 

AAID , — , 

Length Length 


score 


probability 


AI7503000983J.3609S8_c2_905 


ir 


| |4179 ||270 | |89 | 


|109 ||2 


.le-06 



Description 

pir: [LN:C69891] [AC : C69891 : S574 06 ] [PN:yne J protein: protein 160] [GN:yneJ ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e249647 :gl405451] [LN : BC170DEGR] [AC:Z73234] 
[PN:YneJ] [GN:yneJ] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE :B . subtilis DNA (26.2 

kb fragment; 170 degree region).] [NT : identical to yoxl (from acc . no. X87845) ] 
[SP:P45710] [LE:6147] [RE:6638] [DI:direct] >gp : [GI : ell83454 : g2634179] [LN : BSUB0010] 
[AC:Z99113 :AL009126] [GN:yneJ] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 10 of 21): from 1781201to 2014980.] 
[NT: alternate gene name: yoxl] [SP:P45710] [LE: 142526] [RE: 143017] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength J - 



IAI7503000983 13678135 t3 575 



[4180 |p^~||52 ||84 ||0. 00093 



Description 

pir: [LN:B71622] [AC:B71622 ] [PN:metal binding protein (DHHC domain) PFB0140w] 
[GN:PFB0140w ] [OR : Plasmodium falciparum] [DB:pir2] >gp : [GI :g3845106] [LN : AE001375] 
[AC:AE001375:AE001362] [PN:metal binding protein (DHHC domain)] [GN: PFB0140w] 
[OR Plasmodium falciparum] [SR:malaria parasite P. falciparum] [DB : genpept - inv2 ] 
[DE : Plasmodium falciparum chromosome 2, section 12 of 73 of thecomplete sequence.] 
[NT: identified by sequence similarity; putative] [LE:814] [RE:1050] [DI:direct] 



NT AA 

ORF Name NTID AAID „ — _„ — , , score probability 
LengthLength c L - 



p7503000983_13678462^fl_15 | |409 | [ 4181 ||BST~| | ifl7 | pT"~|| S . 8e-27 "| 

Description 

sp: [LN:SPHX_SYNP7] [AC:P39665] [GN:SPHX] [OR : SYNECHOCOCCUS SP] [SR:PCC 7942 , ANACYSTIS 
NIDULANS R2] [DE:SPHX PROTEIN PRECURSOR] [SP:P39665] [DB : swissprot] >pir : [LN : S544 02] 
[AC:S54402 ] [PN:SphX protein] [CL:sphX protein] [OR : Synechococcus sp . ] [DB:pir2] 
>gp: [GI:dl005689:g496319] [LN:SYOSPHX] [AC:D26161] [PN:SphX] [GN:sphX] 
[OR: Synechococcus sp.] [SR : Synechococcus sp. (strain : PCC7942 ) DNA] [DB : genpept -bet 1] 
[DE : Synechococcus sp. sphX gene for phosphate regulon SphX, completecds . ] [LE:317] 
[RE: 1330] [DI:direct] 



NT AA 

ORF Name NTID AAID — - — , , score probability 
LengthLength c 



AI7503000983_13^91280_tl_42 

Description 

NO-HIT 



410 | |4182 ||TT5 1 144 j 
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NT AA 

QRF Name SITO MID Le — thLe — th score probability 



Al7B030009S3_iJB4bJ0U_t2jm | | 41i | [4183 [ 102 [[73 [[ 0,0055 



Description 

sp: [LN:YCIB_BUCAP] [AC:P42397] [OR :BUCHNERA APHIDICOLA] [DE : HYPOTHETICAL 21.4 KD 
PROTEIN IN TRPA 3'REGION] [SP:P42397] [DB : swissprot ] >pir : [LN : F49897] [AC : F49897 : S36433 
] [PN:protein VI (trpA 3 ' -region) ] [OR:Buchnera aphidicola] [DB:pir2] >gp : [GI : g396663] 
[LN : BATRYOPEA] [AC:Z19055] [PN:ORF 6] [OR:Buchnera aphidicola] [DB : genpept-bctl] 
[DE:B. aphidicola tryptophan operon.] [NT : homologous to E. coli ORF6 located downstream 
of] [SP:P42397] [LE:7540] [RE:8073] [Dlrdirect] 



NT AA 

QRF Name NT ID AAID Le ~ thLe ~ th score probability 

|&I7S030009fl3J.38762i6_c3_1046 | [412 [ |4184 |[XuTT| [ 340 ||gg5~|| l . le-6S — | 

Description 

gp: [GI:ell81777:g2632149] [LN: SCY14029] [AC:Y14029] [PN : antiterminator] [GNrglcT] 
[OR: Staphylococcus carnosus] [DB : genpept-bctl] [DE : Staphylococcus carnosus glcT gene.] 
[LE:213] [RE: 1076] [DI:direct] 

NT AA 

QRF Name NT ID AAID — — , , score probab ility 
Length Length ^ L - 



|A17S03000583_i42B1643J:3_B47 | [413 | [ 4185 |pOT~| |102 | pn~~ | |B . 2e- 10 — | 

Description 

pir: [LN:H69891] [AC:H69891 ] [PN : hypothetical protein yneR] [GN:yneR ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : e249654 :gl405458] [LN : BC170DEGR] [AC:Z73234] [PN : YneR] 
[GN:yneR] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis DNA (26.2 kb 
fragment; 170 degree region).] [LE:13221] [RE:13508] [DI : complement] 

>gp: [GI:ell83464:g2634189] [LN : BSUB0010] [AC : Z99113 : AL009126] [GN:yneR] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
10 of 21): from 1781201to 2014980.] [LE:149600] [RE:149887] [D I : complement] 

NT AA 

QRF Name NTID AAID — — , , score probability 
Length Length -L - 



AI7B03000983_144B7876_i3_b21 | [414 | [4185 — | [44 | [I53~~p . Ie-1B 



Description 

gp: [GI:gl022726] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF1] 

[LE:1101] [RE: 1922] [DI : complement] 

NT AA 

QRF Name NTID AAID — — , , score probabil ity 
Length Length L 



AI7S030009B3J.447193B_c3JL027 | [415 | [ 4187 || 183 ||60 | p71~|| 2 . 5e-19 



Description 

sp: [LN:RL33_BACST] [AC:P23375] [GN: RPMG] [OR:BACILLUS STEAROTHERMOPHILUS] [DE:50S 
RIBOSOMAL PROTEIN L3 3] [SP:P23375] [DB : swissprot] >pir : [LN : B48396 ] [AC:B48396 ] 
[PN:ribosomal protein L33] [CL : Escherichia coli ribosomal protein L33] [OR:Bacillus 
stearothermophilus] [DB:pir2] 



173 



NT AA 

ORFJiame NTID AAI3 Le ~ thLe ~ th score probability 



|AI7B030009B3_146ibBBJ:lJ.6 | |416 | [ 4188 [IT3 1| 712 [ |2Tg e-70 

Description 

pir : [LN:D69419] [AC:D69419 ] [PN : phosphate ABC transporter, permease protein (pstC) 

homolog] [CL:phoW protein] [OR : Archaeoglobus fulgidus] [DB:pir2] >gp : [GI :g2649218] 
[LN:AE001010] [AC:AE001010 :AE000782] [ PN : phosphate ABC transporter, permease protein] 
[GN:AF1357] [OR : Archaeoglobus fulgidus] [DB : genpept - bet 2 ] [DE : Archaeoglobus fulgidus 

section 97 of 172 of the complete genome.] [NT: similar to SP:P46339 PID: 903304 

PID:1303855] [LE:5294] [RE:6187] [DI: direct] 



NT AA 

ORF Name NTID AAID — — score probability 



Length Length - 

AI7B030005fl3jL46344SOJ:lJ. | |4i7 | [4189 |puT^ [57 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

AI75030009B3_147233B7_ca_904 | |418 [ [4190 | 93 | p3u~|| 2 . 4e-21 



Description 

sp : [LN : YNEF_BACSU] [AC:P45708] [GN: YNEF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 8.3 KD 

PROTEIN IN TTK-CCDA INTERGENIC REGION] [SP:P45708] [DB : swissprot ] >pir : [LN: S57403] 
[AC:S57403 :A69891 ] [PN : hypothetical protein yneF] [GN:yneF ] [CL : conserved 

hypothetical protein yneF] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e249644 :gl405448] 
[LN: BC170DEGR] [AC:Z73234] [PN : YneF] [GN:yneF] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:B. subtilis DNA (26.2 kb fragment; 170 degree region).] [NT : identical to yoxG (from 

acc. no. X87845)] [SP:P45708] [LE:4225] [RE:4443] [DI:direct] >gp : [GI : g870924] 
[LN : BSTKTCCDA] [AC:X87845] [GN:orf] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:B. subtilis ORF120, ORF160, tkt and ccdA genes . ] [SP:P45708] [LE:1011] [RE:1229] 
[DI:direct] >gp : [GI :ell83450 : g2 6 34 17 5] [LN: BSUB0010] [AC : Z99113 : AL009126] [GN:yneF] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis complete 

genome (section 10 of 21): from 1781201to 2014980.] [NT : alternate gene name: yoxG] 
[SP:P45708] [LE:140604] [RE:140822] [DI:direct] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength c 



AI7503000983_1485008^_i:l_18 | [419 | [ 4191 |pTT~| | 310 | | S80 || 4.2e-88 



Description 

Sp : [LN : YQGK_BACSU] [AC:P46342] [GN: YQGK] [OR:BACILLUS SUBTILIS] [DE : INTERGENIC REGION 
(ORF75)] [SP:P46342] [DB : swissprot] >pir : [LN: E69956] [AC:E69956 ] [PN: phosphate ABC 
transporter (ATP-binding pro) homolog yqgK] [GN:yqgK ] [CL: inner membrane protein 
malK:ATP-binding cassette homology] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl013193:gl303858] [LN : BACJH642] [AC : D84432 : D823 70] [PN: YqgK] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB : genpept -bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 141613] 
[RE:142395] [DI:direct] >gp : [GI :dl010228 :g903307] [LN : BACPST] [AC:D58414] [PN:ORF75] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 pheAl) ) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis DNA for homologues of the E. coli pst 
geneproducts.] [NT: Similality to phosphate transport protein (PstB) of] [LE:3782] 
[RE:4564] [DI: direct] >gp : [GI :ell85762:g2 634928] [LN : BSUB0013] [AC : Z99116 : AL009126] 
[GN:yqgK] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 13 of 21): from 2395261to 2613730.] [NT : alternate gene name: 
yzmF; similar to phosphate ABC] [SP:P46342] [LE: 181226] [RE: 182008] [DI : complement] 



174 



ORF Name 



NTID 



NT 



AA 



AAID _ L , „ 



Length Length 



score probability 



IAI7503000983 14BVfcbb3 c2 919 



] | 420 1 | 4192 H 9 ^ 6 IP 31 IP 70 P-9e-S5 



Description 

gp: [GI:gll47744] [LN : EHU42211] [AC:U42211] [PN:PBP 5 synthesis repressor] [GN:psr] 
[FN: involved in the regulation of penicillin] [OR : Enterococcus hirae] [DB :genpept-bct2] 
[DE : Enterococcus hirae PBP 5 synthesis repressor (psr) gene, completecds . ] [LE:746] 
[RE: 1627] [DI:direct] 



ORF Name 



NTID AAID 



Ar7503000983_1490i5i2_c3_i0i2 



wit 



NT AA 
LengthLength 



score 



[TFT 



probability 
7.7e-48 



Description 

sp: [LN:NUC_STAIN] [AC:P43269] [GN : NUCI : NUC] [OR : STAPHYLOCOCCUS INTERMEDIUS] 
[EC:3. 1.31.1] [DE: (STAPHYLOCOCCAL NUCLEASE)] [SP:P43269] [DB : swissprot] 
>pir : [LN:S26079] [AC:S26079 ] [PN : thermonuclease] [CL :micrococcal nuclease] 
[OR staphylococcus intermedius] [DB:pir2] >gp : [GI :g47146] [LN:SINUC8] [AC:X67678] 
[PN: thermonuclease] [GN:nuc] [OR : Staphylococcus intermedius] [DB : genpept-bctl] 
[DE : S . intermedius nuc gene for thermonuclease.] [SP:P43269] [LE:495] [RE: 1001] 
[DI: direct] 



ORF Name 



LL7503000983 1508482& c3 1034 



NT 

NTID AAID — 



AA 



LengthLength 



score 



][ 



[4194 [[TOS 1 



probability 
3.0e-23 



Description 

sp: [LN:CCDC_BACSU] [AC:P45710] [GN:CCDC] [OR:BACILLUS SUBTILIS] [DE:CCDC PROTEIN] 
[SP:P45710] [DB: swissprot] >gp : [GI :g870927] [LN: BSTKTCCDA] [AC:X87845] [GN:ORF160] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis ORF120, ORF160, tkt and ccdA 
genes.] [SP:P45710] [LE:2942] [RE:3424] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



IAI7503000983 15757712 tl 23 



] |423 | [4195 ||732 1 |243 [pS - ||1.5e-6 



probability 





Description 

gp: [GI:g3800828] [LN : AF076684] [AC : AF076684] [PN : oligopeptide transporter putative 
ATPase domain] [GN:opp-2F] [OR : Staphylococcus aureus] [DB : genpept -bct2 ] 
[DE: Staphylococcus aureus oligopeptide transporter putative membranepermease domain 
<opp-2B) , oligopeptide transporter putativemembrane permease domain (opp-2C) , 
oligopeptide transporterputative ATPase domain (opp-2D) , and oligopeptide 
transporterputative ATPase domain (opp-2F) genes, complete cds . ] [LE:2735] [RE: 3436] 
[DI : direct] 



ORF Name 



NTID AAID J~ t , T _ ~, ,, score 
Length Length 



probability 



^17503000983 15803510 c2 854 



Description 
NO-HIT 



] |424 | [4195 | p^— 1| 43 | 



ORF Name 



NTID 



7V7VTr , NT AA 

AAID — — score 

LengthLength 



probability 



AI7503000983_16205378_ci_8i5 

Description 

NO-HIT 



|4i97 |p^~~| pr 



175 



ORF Name 



NT ID 



NT AA 
AAID — , — t , score 
Length Length 



1AI 7 503000983 162843 Y2 330 



] |426 | |419B ||458 1 1155 ||395 | [ 



probability 
l.Oe-36 



Description 

sp: [LN:LYSP_ECOLI] [AC:P25737] [GN : LYSP : CADR] [OR : ESCHERICHIA COLI] [DE : LYSINE-SPECIFIC 
PERMEASE] [SP : P25737] [DB : swissprot] 



ORF Name 



NT ID 



„ T „ NT AA 

AAID — . — x , score 

LengthLength 



IAI7503000983 16835388 EI 19 



] | 427 1 | 41 ^ |pT^|pi7 [Im- 



probability 
6.2e-23 



Description 

gp: [GI :dl023735 :g2547082] [LN:D89963] [AC:D89963] [PN:negative regulatory protein of 
pho regulon] [GN:phoU] [OR : Enterobacter cloacae] [SR : Enterobacter cloacae 

(strain: IFO3320) DNA] [DB : genpept-bctl] [DE : Enterobacter cloacae pstS, pstC, pstA, pstB 
and phoU genes , complete cds . ] [NT: The phoU gene is required for chemotaxis to ward] 

[LE:4554] [RE:5279] [DI:direct] 



ORF Name 



1AI 7 503000983 17040911 cl 77 1 



Description 
NO-HIT 



NT AA 
NTID AAID r — — , , score 
LengthLength 



probability 



] |428 | [4200 |pi~~[ p" 



ORF Name 



LAI7503000983 190707 tl 81 



Description 
NO-HIT 



NTID 



TIT 



NT AA 
AAID — — _ score 
LengthLength 



probability 



TIUT 



TIT 



TT 



ORF Name 



AI7503000983 194431 C3 1033 



Description 
NO-HIT 



_ ___ _ „ ra NT AA 

NTID AAID — _ — , , score 

LengthLength 



][ 



¥3"0~ 



probability 



[4202 |[P2 | |3T 



ORF Name 



« __ _ _ _ _ NT AA 

NTID AAID — , . — . . score 
LengthLength 



probability 



IAI7503000983 19540678 cl 797 



] |431 | [ 4203 |prjl5 — | | 14b [ p7Q— || 7.7e-23 



Description 

gp: [GI:g4835822] [LN : AF102174 ] [AC : AF102174] [PN:glycine betaine transporter BetL] 
[GN:betL] [OR:Listeria monocytogenes] [DB :genpept-bct2] [DE:Listeria monocytogenes 

glycine betaine transporter BetL (betL)gene, complete cds.] [LE:209] [RE: 1732] 
[DI : direct] 



ORF Name 



NTID 



7V7VTr , NT AA 

AAID — — score 

LengthLength 



[&I7503000983 19730438 c2 926 



] | 432 | |4204 ||127J]|423 [p?T 



probability 
3.3e-230 



Description 

pir: [LN:JC5326] [AC:JC5326 ] [PN : methicillin resistance factor FEMB] [GN:femB ] 
[CL:methicillin resistance factor femA] [OR : Staphylococcus epidermidis] [DB:pir2] 
>gp: [GI:gl815620] [LN : SEU23714] [AC:U23714] [PN : FEMB] [GN:femB] [OR : Staphylococcus 
epidermidis] [DB : genpept-bctl] [DE : Staphylococcus epidermidis factor essential for 
methicillinresistance FEMB (femB) gene, complete cds.] [NT: Factor essential for 
methicillin resistance] [LE:33] [RE: 1286] [DI:direct] 



176 



ORF Name 



NT AA 

^ AAI2 Le ^thLe^th ^^ probability 



117503000983 19735987 cl 7 54 



] [433 | [4205 ||1581||526 [1 1924 1 1 9.76-199 



Description 

pir: [LN:D69884] [AC:D69884 ] [PN : conserved hypothetical protein ymcB] [GNrymcB ] 
[CL: conserved hypothetical protein b0835] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell85292 :g2634073] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:ymcB] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [LE:173791] 
[RE: 175320] [DI :direct] 



ORF Name 



NTID 



„ TT ^ NT AA 

AAID — , — . , score 

LengthLength 



probability 



IAI7503000983 19953281 c3 1015 



][ 



[4206 |[r?T7~[[533 [[822 ||5.8e-82 



Description 

sp: [LN:DHOM_BACSU] [AC:P19582] [GN :HOM: TDM] [OR:BACILLUS SUBTILIS] [EC:1.1.1.3] 
[DE : HOMOSERINE DEHYDROGENASE, (HDH) ] [SP:P19582] [DB : swissprot ] >gp : [GI : g5584 94 ] 
[LN: BACHOM] [AC :M23217 : J04034] [PN : homoserine dehydrogenase] [GN:hom] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis DNA] [DB : genpept-bctl] [EC:1. 1.1.3] [DE:Bacillus 
subtilis homoserine dehydrogenase (horn) gene, completecds, threonine synthase (thrC) 
gene, 5' end of cds.] [LE:2 76] [RE: 1577] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



117503000983 20085512 c2 899 



][ 



[4207 |[554 [ 



TTT 



probability 
|i.Se-77 



Description 

gp: [GI:g3064126] [LN : AF011784 ] [AC : AF011784] [PN: catalase] [GN:katA] [OR:Vibrio 
fischeri] [DB : genpept -bct2 ] [DE: Vibrio fischeri catalase (katA) gene, complete cds.] 
[LE:256] [RE:1704] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ T — . , score 
LengthLength 



probability 



IAI7503000983 20095803 c3 994 



~J [436 | [4208 |pT7 | [58 i 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID . — , . 



LengthLength 



score 



117503000983 20312510 t3 505 



][ 



[4209 |pT3T~j|345 ||1070 [ [: 



probability 
3.ie-108 



Description 

gp: [GI:g3800825] [LN : AF076684] [AC : AF076684] [PN : oligopeptide transporter putative 
membrane] [GN:opp-2B] [OR : Staphylococcus aureus] [DB : genpept-bct2] [DE : Staphylococcus 
aureus oligopeptide transporter putative membranepermease domain (opp-2B) , oligopeptide 
transporter putativemembrane permease domain (opp-2C) , oligopeptide transporterputative 
ATPase domain (opp-2D) , and oligopeptide transporterputative ATPase domain (opp-2F) 
genes, complete cds.] [LE:173] [RE: 1159] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ — , - score 
LengthLength 



probability 



AI7503000383_20347iSS_t3_48S 

Description 

NO-HIT 



^ [438 | [4210 ||147 | [48 ~| 



177 



ORF Name 



NT ID 



NT AA 
AAID — ■m _ — . , score 
Length Length 



probability 



^17503000983 20370457 c2 894 



] | 439 | | 4211 | P | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID . . , . — 



LengthLength 



score 



1AI7503000983 20400051 t2 319 



[4212 |pT0~[|259 ||69i | 



probability 
4.4e-S8 



Description 

sp: [LN:LEXA_BACSU] [AC:P31080] [GN : LEXA : DINR] [OR:BACILLUS SUBTILIS] [DE : SOS REGULATORY 
PROTEIN LEXA/DINR] [SP:P31080] [DB : swissprot] >pir : [LN : A41315] [AC : A41315 : B6 9651 ] 
[PN: transcription repressor of SOS regulon lexA/dinR] [GN : lexA : dinR ] [CL:lexA protein] 
[OR : Bacillus subtilis] [DB:pir2] >gp : [GI :g289288] [LN : BACLEXA] [AC:M64684] [GN : lexA] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis DNA] [DB : genpept-bctl] [DE:Bacillus 
subtilis lexA gene, 3' end.] [LE:390] [RE:1007] [DI:direct] >gp : [GI : ell83444 : g2634169] 
[LN : BSUB0010] [AC : Z99113 : AL009126 ] [PN : transcriptional regulator] [GN : lexA] 
[FN: negative regulation of the SOS regulon] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 10 of 21): from 1781201to 2014980.] 
[NT alternate gene name: dinR] [SP:P31080] [LE: 135694] [RE: 136311] [D I : complement] 



ORF Name 



NTID 



NT AA 
AAID — — _ score 
LengthLength 



probability 



IAI7503000983 20423127 c3 991 



][ 



[4213 |p^7T]|891 ||2302 ||8.5e-239 



Description 

pir: [LN:C69663] [AC:C69663 ] [ PN : DNA mismatch repair (recognition) mutS] [GN:mutS ] 
[CL : DNA mismatch repair protein mutS] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85295:g2634076] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:mutS] [ FN : DNA 
mismatch repair recognition] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P49849] 
[LE: 176693] [RE: 179269] [DI : direct] 



ORF Name 



NTID AAID — ^ — , score 
LengthLength 



probability 



IAI7503000983 20485875 c 3 1008 



j |442 | [4214 ||231~]|75 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID 

Length Length 



score 



117503000983 20501250 c2 922 



] |443 | [4215 |[TuTT||342 |[F5T 



probability 
3.0e-53 



j 



Description 

Sp: [LN : TRPD_MET JA] [AC:Q57686] [GN : TRPD : MJ0234] [OR : METHANO COCCUS JANNASCHII] 
[EC:2.4.2.18] [DE : ANTHRANILATE PHOSPHOR I BOS YLTRANSFERASE , ] [SP:Q57686] [DB : swissprot] 
>pir: [LN:C64329] [AC:C64329 ] [PN : anthranilate synthase, subunit II'] [CL : anthranilate 
phosphoribosyltransf erase : trpD homology] [OR : Methanococcus jannaschii] [EC : 4 . 1 . 3 . 27] 
[DB:pir2] [MP:FOR225111-226121 ] >gp : [GI : gl590969] [LN:U67479] [AC : U67479 : L77117] 
[PN: anthranilate synthase component II (trpD)] [GN:MJ0234] [OR : Methanococcus 
jannaschii] [DB :genpept-bct2] [DE : Methanococcus jannaschii section 21 of 150 of the 
complete genome.] [NT: similar to GB:M33814 SP:P20574 PID: 151234 percent] [LE:3662] 
[RE:4672] [DI:direct] 



178 



NT AA 

ORF Name NTID AAID Le ^ t frLenqth score probability 



AlVS03000SBJJ*0b04bl2_c2_664 | [444 | [ 4216 || ±V!> H"| |585 | p^~| |i . 9e-80 



Description 

gp: [GI:g4981713] [LN : AE001773] [AC : AE001773 : AE000512] [PN : 2 -oxoacid ferredoxin 
oxidoreductase, alpha] [GN:TM1164] [OR : Thermo toga maritima] [DB : genpept -bct2] 
[DE:Thermotoga maritima section 85 of 136 of the complete genome.] [NT: similar to 
GB:AE000666 percent identity: 68.99;] [LE:8189] [RE:9865] [DI:direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|&I7503000983_205327_c2_S3g | [ 445 | [ 4217 |p?T] [443 | p7T|| 8 . 9e-i73 

Description 

sp: [LN:GID_BACSU] [AC:P39815] [GN:GID] [OR:BACILLUS SUBTILIS] [DE : GID PROTEIN] 
[SP:P39815] [DB : swissprot] >pir: [LN:A69632] [AC : A69632 : S61497 ] [PN :glucose-inhibited 
division protein gid] [GN:gid ] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell85204 :g2633985] [LN : BSUB0009] [AC:Z99112 :AL009126] [PN : glucose- inhibited 
division protein] [GN:gid] [OR:Bacillus subtilis] [DB : genpept -bet 1 ] [DE:Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene 
name: ylyC] [LE:86791] [RE:88098] [DI: direct] >gp : [GI : e332181 :g2462971] [LN : BSYLQGCOD] 
[AC:AJ000975] [PN:Gid protein] [GN:gid] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis ylqg to codV gene region.] [SP:P39815] [LE:5959] [RE:7266] 
[DI :direct] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength *- 



|Ar/b03000983_2074B462_cl_767 | |446 | | 42i8 || i257 "| [ 418 | p2~[| J . 4e-86 "j 

Description 

pir : [LN:B69888] [AC:B69888 ] [PN : GTP -binding protein proteinase modulator homolog ynbA] 
[GN:ynbA ] [CL :GTP-binding protein hflX] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:gl750108] [LN : BSU66480] [AC:U66480] [PN: YnbA] [GN:ynbA] [OR: Bacillus subtilis] 
[DB: genpept -bet 1] [DE:Bacillus subtilis SpoVK (spoVK) , YnbA (ynbA), YnbB (ynbB) , 
GlnR(glnR), glutamine synthetase (glnA) , YnaA (ynaA) , YnaB (ynaB) , YnaC(ynaC), YnaD 
(ynaD), YnaE (ynaE) , YnaF (ynaF), YnaG (ynaG), YnaH(ynaH), Ynal (ynal) , YnaJ (ynaJ) , 
xylan beta- 1 , 4-xylosidase (xynB) , xylose repressor (xylR) , xylose isomerase (xylA) , 
xylulose kinase (xylB) , YncB (yncB) , YncC (yncC) , YncD (yncD) and YncE (yncE)genes, 
complete cds . ] [LE:1382] [RE:2386] [DI:direct] >gp : [GI : ell83402 :g2634127] [LN : BSUB0010] 
[AC:Z99113 :AL009126] [GN:ynbA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 10 of 21): from 1781201to 2014980.] 
[NT: similar to GTP-binding protein protease modulator] [LE: 93433] [RE: 9443 7] 
[DI:direct] 

NT AA 

ORF Name NTID AAID — «^ — , , score probability 
LengthLength c ^ 



AI7S03000983_20819^12^il_50 | | 447 | [ 4215 [p31 [ pTT 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — , , score probability 

Lengtn Length 



AI7S03000383_205ga78aj:3_S79 | |448 | [ 4220 || 18u ^p~ 



Description 
NO-HIT 



179 



ORF Name 



NTID 



AAID 



[AI7503000983 21641477 t2 250 



14221 



NT AA 
LengthLength 
954 IPT7- 



score probability 
687 ||1.2e-67 



Description 

pir: [LN:E69419] [AC:E69419 ] [PN : phosphate ABC transporter, permease protein (pstA) 

homolog] [CL:phoW protein] [OR : Archaeoglobus fulgidus] [DB:pir2] >gp : [GI :g2649217] 
[LN:AE001010] [AC : AE001010 : AE000782] [PN : phosphate ABC transporter, permease protein] 
[GN:AF1358] [OR : Archaeoglobus fulgidus] [DB : genpept -bct2 ] [DE : Archaeoglobus fulgidus 

section 97 of 172 of the complete genome.] [NT: similar to GB:L10328 SP:P07654 GB:K01992 

GB:X02723] [LE:6184] [RE:7035] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — _ — , , score 
LengthLength 



1AI7503000983 21751938 t3 601 



[4222 j|708 | pTT 



TFT" 



probability 
3.0e-14 



Description 

gp: [GI:g2897104] [LN: AF020798] [AC : AF02 0798 ] [PN:putative host cell surface-exposed 
lipoprotein] [OR : Streptococcus thermophilus bacteriophage TP-J34] [DB : genpept -phg] 

[DE: Streptococcus thermophilus bacteriophage lysogeny module, integrasehomolog (int) , 
putative host cell surface-exposed lipoprotein, putative metallo-proteinase , repressor, 
Cro-like regulatoryprotein, and Pl-antirepressor homolog genes, complete cds.] 

[NT:orfl42] [LE:3941] [RE:4369] [DI : complement] 



ORF Name 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



V17503000983 21912535 tl 212 



4223 



75" 



110.018 



] 



Description 

pir: [LN:B21124] [AC:B21124 ] [PN:Bkm-like sex- determining region hypothetical protein 
CS314] [GN: Bkm ] [OR : Drosophila melanogaster] [DB:pir2] 



ORF Name 



NT AA 

NTID AAID — — . score proba bility 
LengthLength L - 



IAT750300"0983 2195255 tl 6 



T52" 



Description 
NO-HIT 



|147 |p 1 



ORF Name 



NT AA 

NTID AAID — — , , score probabi lity 
LengthLength 



1*7503000983 22304535 tl 113 



Description 
NO-HIT 



] |453 | [4225 |p3u~||49 | 



ORF Name 



NTID 



NT 



AA 



AAID — _ — . 
LengthLength - 



IAI7503000983 22368803 c2 857 



j |454 | [4226 |p7yri|798 ||1659 



probability 
1.2e-170 



Description 

sp: [LN:SP3E_BACSU] [AC : P21458 : P21459] [GN:SPOIIIE] [OR:BACILLUS SUBTILIS] [DE : STAGE III 
SPORULATION PROTEIN E] [SP : P21458 : P21459] [DB : swissprot] >pir : [LN : S09411] 
[AC:S09411:A32269:B32269:F69712 ] [ PN : DNA translocase spoIIIE] [GN:spoIIIE ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85271 :g2634052] [LN : BSUB0009] 
[AC:Z99112 :AL009126] [ PN : DNA translocase] [GN:spoIIIE] [FN:required for chromosome 
partitioning through] [OR: Bacillus subtilis] [DB : genpept -bctl] [DE: Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P21458] [LE:153248] 
[RE:155611] [DI :direct] 



180 



ORF Name 



F 



17503000983 22390917 £3 560 



Description 
NO-HIT 



NT AA 
NT ID AAID — _ T — , . score 
LengthLength 



probability 



] | 455 | I 4227 ||5T" 



ORF Name 



NT 



AA 



NT ID AAID — . T — . 
LengthLength 



score 



1A17503000983 22444075 c3 959 



J [4228 ||371~|pW 



755" 



probability 
|2.3e-78 



Description 

gp: [GI:dl034831:g3767595] [LN: ABO 15 195] [AC : AB015195] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain:RN450) DNA] [DB :genpept-bctl] [DE : Staphylococcus 

aureus gene for LytN and Eprh, complete cds . ] [NT : 0RF4] [LE:3221] [RE:4024] [DI:direct] 

>gp: [GI:dl034831:g3767595] [LN : AB015195] [AC : AB015195] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain : RN4 50 ) DNA] [DBrgenpept] [DE : Staphylococcus aureus 

gene for LytN and Eprh, complete cds.] [NT : 0RF4 ] [LE:3221] [RE:4024] [DI:direct] 



ORF Name 



NT 

NT ID AAID _ — 



AA 



IAX7503000983 22453425 tl 195 



Description 
NO-HIT 



LengthLength 
] |457 | [4229 ||138 ~] |4S | 



score probability 



ORF Name 



NT ID AAID 



NT 



AA 



LengthLength 



score 



IAI7503000983 22470463 c3 989 



£55" 



"| [4230 ||810~[|259 |pT" 



probability 
|3.3e-88 



Description 

pir: [LN:G69884] [AC:G69884 ] [PN : conserved hypothetical protein ymdB] [GNrymdB ] 
[CL: hypothetical protein ymdB] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell85288:g2634069] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:ymdB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [LE:169889] 
[RE: 170683] [DI :direct] 



ORF Name 



NT AA 
NTID AAID — — , . score 
LengthLength 



probability 



E 



17503000983 22539812 ci 758 



] I 455 I I 4231 |P^]|4i7 |pSTT||l.Se-i3i 



Description 

pir: [LN:C69888] [AC:C69888 ] [PN : conserved hypothetical protein ynbB] [GN:ynbB ] 
[CL: hypothetical protein ynbB] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl750109] 
[LN:BSU66480] [AC:U66480] [PN : YnbB] [GN:ynbB] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis SpoVK (spoVK) , YnbA (ynbA) , YnbB (ynbB), GlnR(glnR), glutamine 

synthetase (glnA) , YnaA (ynaA) , YnaB (ynaB), YnaC(ynaC), YnaD (ynaD) , YnaE (ynaE), YnaF 
(ynaF) , YnaG (ynaG) , YnaH(ynaH), Ynal (ynal) , YnaJ (ynaJ) , xylan beta-1, 4-xylosidase 
(xynB) , xylose repressor (xylR) , xylose isomerase (xylA) , xylulose kinase (xylB) , YncB 
(yncB) , YncC (yncC) , YncD (yncD) and YncE (yncE)genes, complete cds.] [LE:2574] 
[RE:3839] [DI:direct] >gp : [GI :ell83403:g2 634128] [LN:BSUB0010] [AC : Z99113 : AL009126] 
[GN:ynbB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 

complete genome (section 10 of 21): from 1781201to 2014980.] [NT:similar to 

hypothetical proteins] [LE: 94625] [RE: 95890] [DI: direct] 



181 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th gggre probability 



AI75030009aJJ^<l>.j932_tl_22 | |450 | [ 4232 | [g72~~ ]| 323 [1 10211 1 4.86-103 



Description 

gp: [GI :g3800826] [LN: AF076684] [AC : AF076684] [PN : oligopeptide transporter putative 
membrane] [GN:opp-2C] [OR: Staphylococcus aureus] [DB :genpept-bct2] [DE : Staphylococcus 
aureus oligopeptide transporter putative membranepermease domain (opp-2B) , oligopeptide 
transporter putativemembrane permease domain (opp-2C) , oligopeptide transporterputative 
ATPase domain (opp-2D) , and oligopeptide transporterputative ATPase domain (opp-2F) 
genes, complete cds . ] [LE:1152] [RE: 1982] [DI: direct] 



„ NT AA 
QRF Name NTID AAID — ^_ — scor e probability 
Length Length c *- 



[M7503000983_22fl50fl85_ci_752 | \U1 J [4233 |p5T| |53i | |T75T]| i . 7e-iBb 

Description 

pir: [LN:F69884] [AC:F69884 ] [PN : conserved hypothetical protein ymdA] [GN:ymdA ] 
[CL: hypothetical protein ymdA] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI : ell85287 : g2634068] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:ymdA] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpep t - be 1 1 ] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [LE:168258] 
[RE: 169820] [DI :direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



I75030009S3_234454S3_c3_9S5 1 |2S2 1 [4234 |pn~| |255 | [1002 ||4 . 9e-101 



Description 

pir: [LN:A69699] [AC : A69699 : SO 9 561 ] [PN : ribosomal protein S2 (rpsB) : ribosomal protein 
BS1] [GN : rpsB ] [CL : Escherichia coli ribosomal protein S2] [OR:Bacillus subtilis] 

[DB:pir2] >gp: [GI : ell8 52 40 :g2 634021] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : ribosomal 
protein S2] [GN: rpsB] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P21464] [LE:118905] 

[RE : 119645] [DI :direct] 



NT AA 

ORF Name NTID AAID — , — score probability 
LengthLength 



AI7503000983_23472175_c3_975 | |453 | [4235 |p^~[ | 138 | p^r"|| b . 3e-33 



Description 

Sp : [LN : RBFA_BACSU] [AC:P32731] [GN :RBFA] [OR:BACILLUS SUBTILIS] [DE : RIBOSOME- BINDING 

FACTOR A (P15B PROTEIN)] [SP:P32731] [DB : swissprot] >pir : [LN : G36905] 

[AC:G36905:G69689:S31996 ] [PN : ribosome- binding factor A rbfA] [GN:rbfA ] 

[CL: Escherichia coli protein P15B] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g580902] 

[LN : BSORF1T7A] [AC:Z18631] [GN : ORF6] [OR:Bacillus subtilis] [DB : genpept-bctl] 

[DE:B. subtilis infB-nusA operon.] [SP:P32731] [LE:5131] [RE:5484] [DI:direct] 

>gp: [GI:ell85256 :g2634037] [LN : BSUB0009] [AC : 299112 :AL009126] [PN : ribosome -binding 

factor A] [GN:rbfA] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 

complete genome (section 9 of 21) : from 1598421to 1807200.] [NT : alternate gene name: 

ymxE, ylxO] [SP:P32731] [LE:137422] [RE:137775] [DI: direct] 



ORF Name 



NTID 



AAID 



IAT7503000993 23491077 cl 824 



NT AA 
LengthLength 
T2T^ 



score 



probability 



TIT 



11037 ||9.6e-105 



Description 

Sp: [LN :DCDA_BACSU] [AC:P23630] [GN : LYSA : LYS] [OR:BACILLUS SUBTILIS] [EC : 4 . 1 . 1 . 20] 
[DE : DIAMINOPIMELATE DECARBOXYLASE, (DAP DECARBOXYLASE)] [SP:P23630] [DB : swissprot] 
>gp: [GI:dl013342 :gl304007] [LN : BACJH642] [AC : D84432 :D82370] [PN : LysA] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE:281375] 
[RE: 282700] [DI: direct] 



182 



ORF Name 



NT AA 
NTID AAID — ■» — ^, score 
LengthLength 



11 7 503000983 23554760 t3 646 



] | 465 1 | 4237 1F^~| 



probability 
10.0077 



Description 

gp: [GI:gll23040] [LN: CELF44A2] [AC:U41993] [GN:F44A2.4] [OR : Caenorhabditis elegans] 
[SR: Caenorhabditis elegans strain=Bristol N2] [DB : genpep t - invl ] [DE : Caenorhabditis 
elegans cosmid F44A2 . ] [NT: similar to C4-type zinc finger and to C. elegans] 
[LE: 11552: 11678: 12072: 12484] [RE : 11632 : 12022 : 12435 : 12848] [DI : direct Join] 



ORF Name 



NTID 



^ m NT AA 

AAID — _ — . score 

LengthLength 



117503000383 23557807 £2 357 



T2T5" 



TTT 



probability 
1.8e-48 



Description 

gp: [GI:ell72770:g2598550] [LN:LLAJ109] [AC : AJ000109] [PN :gluthatione peroxidase] 
[GN:gpo] [OR:Lactococcus lactis] [DB : genpept-bctl] [DE : Lactococcus lactis carB and gpo 
genes.] [LE:163] [RE: 636] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — . , score 
LengthLength 



probability 



Ai7503 000983_23597802_c2_942 

Description 

NO-HIT 



] |467 | [4239 ||144~||47 | 



ORF Name 



NTID AAID . 



IAI7503000983 23632758 cl 723 



4240 



NT AA 
LengthLength 



score 



probability 



WOT 



IT 



110.037 



Description 

gp: [GI:g882139] [LN : SCU17174] [AC:U17174] [PN:phase-2 flagellin structural protein] 
[GNifljB] [OR: Salmonella choleraesuis] [DB : genpept - be t 2 ] [DE : Salmonella choleraesuis 

ATCC 6967 phase-2 flagellin structuralprotein (fljB) gene, complete cds . ] [LE:1] 
[RE: 1521] [DI: direct] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



LAI7503000983 23633467 c2 851 



[4241 |328 



score probability 
i.ie-64 



Description 

pir: [LN:D69692] [AC:D69692 ] [PN : riboflavin kinase / FAD synthase ribC] [GN:ribC ] 
[CL:conserved hypothetical protein HI0963] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:e269877:gl592690] [LN : BSRIBRPS] [AC:Z80835] [ PN : FMN adenylyltransf erase] 
[GN:ribC] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis ribC, rpsO and pnpA 
genes.] [NT : riboflavin kinase] [SP:P54575] [LE:88] [RE: 1038] [DI: direct] 
>gp: [GI:ell85258:g2634039] [LN : BSUB0009] [AC : Z99112 : AL009126] [ PN : FAD synthase] 
[GN:ribC] [FN : riboflavin biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC:2.7.1.26:2.7.7.2] [DE:Bacillus subtilis complete genome (section 9 of 21): from 
1598421to 1807200.] [NT : riboflavin kinase] [SP:P54575] [LE:138805] [RE:139755] 
[DI: direct] 



183 



NT AA , , . . 

ORF Name NTIP AAID Le ^ t ^ Le ^ t ^ score probability 



[A1 7 50300098a^Jfei>02bO_cl_729 | | 470 | [ 4242 | | 546 1 1181 | j| l . 6e-55 

Description 

sp: [LN:HSLV_BACSU] [AC:P39070] [GN : HSLV: CLPQ : CODW] [OR : BACILLUS SUBTILIS] [EC : 3 . 4 . 99 . -] 
[DE:HEAT SHOCK PROTEIN HSLV PRECURSOR,] [SP:P39070] [DB : swissprot] >pir: [LN:S61494] 
[AC:S61494 :S45024 :C69601 ] [PN:20S proteasome beta-type chain clpQrheat shock protein 
codW] [GN:clpQ:codW ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : g535349] [LN : BSU13634] 
[AC:U13634] [PNrCodW] [GNicodW] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis JH642 dipeptide permease operon regulators, codV,codW, codX, and codY genes, 
complete cds . ] [LE:1220] [RE:1765] [DIrdirect] >gp : [GI : ell85206 :g2633987] [LN : BSUB0009] 
[AC:Z99112:AL009126] [PN : beta- type subunit of the 20S proteasome] [GNiclpQ] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT : alternate gene name: hslV, codW] [SP:P39070] 
[LE:89093] [RE:89638] [DI: direct] 



NT AA 

ORF Name NTIP AAID — „ — , score probability 

LengtnLengtn 

AI7503000983_23650343_c2_943 | |47i | [4243 |[XTST] |388 | Jgrr~||i. ie-BO 



Description 

pir : [LN:G71097] [AC:G71097 ] [PN:probable amidohydrolase] [GN:PH1043 ] [OR : Pyrococcus 
horikoshii] [DB:pir2] >gp: [GI : dl031084 :g3257458] [LN: AP000004] 

[AC:AP000004 :AB0094 94 :AB009495 :AB009496 : AB0094 97 : AB009498 :AB009499] [PN:387aa long 
hypothetical amidohydrolase] [GN:PH1043] [OR : Pyrococcus horikoshii] [SR : Pyrococcus 
horikoshii (strain :OT3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic 
DNA, 777001-994000 nt . position (4/7 ). ] [NT:similar to Swiss_Prot : P80092 percent 
identity:] [LE:172136] [RE:173299] [DI : complement] 

NT AA 

ORF Name NTIP AAID — — , , score probabi lity 
LengtnLengtn 



[AI7503000983_23671890_cl_807 | |472 | [ 4244 | pgT] | 428 || 518 || 3.9e-92 | 

Pescription 

pir: [LN:A69730] [AC:A69730 ] [PN : UV-damage repair protein uvrX] [GN:uvrX ] [CL:umuC 
protein] [OR:Bacillus subtilis] [PB:pir2] >gp : [GI : ell83597 :g2634570] [LN : BSUB0012] 
[AC:Z99115:AL009126] [PN:UV-damage repair protein] [GN:uvrX] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 12 of 21) : from 
2195541to 2409220.] [NT : alternate gene name: yolE] [LE: 74108] [RE: 75358] 

[DI: complement] >gp : [GI :g3025495] [LN: AF020713] [AC : AF020713] [PN : IMPB/MUCB/SAMB family 
protein] [GN:yolE] [OR : Bacteriophage SPBc2] [DB : genpept -phg] [DE : Bacteriophage SPBc2 
complete genome.] [LE: 14792] [RE: 16042] [DI:direct] 

NT AA 

ORF Name NTIP AAIP — — , , score probabil ity 
Length Length L - 

[4245 ||TTTuT"| piT" 



(Al7b03000983 23673150 c3 1029 



I.le-135 



Pescription 

pir: [LN:C70015] [AC:C70015 ] [PN:GMP reductase, homolog yumP] [GN:yumP ] [OR:Bacillus 
subtilis] [EC:1.6.6.8] [DB:pir2] >gp : [GI :ell84292:g2 635710] [LN:BSUB0017] 
[AC:Z99120 :AL009126] [GN : yumD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 17 of 21): from 3197001to 3414420.] 
[NT: similar to GMP reductase] [LE: 105099] [RE: 106079] [DI:direct] 

>gp: [GI:e311468:gl934831] [LN : BSZ93939] [AC:Z93939] [PN:unknown] [GN:yumD] [OR:Bacillus 
subtilis] [DB: genpept -bet 1] [DE :B . subtilis genomic DNA fragment from yumA to yulF.] 
[NT:potential inosine or guanosine 5' monophosphate] [LE:3536] [RE:4516] [DI:direct] 



184 



ORF Name 



NT 

NT ID AAID , 



AA 



LengthLength 



score probability 



IAI7503000983 23725442 F2 230 



] | 474 | | 4246 || 4S6 1| 151 IP 7 |P- 0028 



Description 

gp: [GI :g3582212] [LN : AE001272] [AC : AE0012 72] [PN : Potential membrane spanning protein] 
[GN:ORF00031] [OR : Lactococcus lactis] [DB :genpept-bct2] [DE : Lactococcus lactis DPC3147 
plasmid pMRCOl, complete plasmidsequence . ] [NT:similar to GB:Z30588 PID:459257 percent 
identity:] [LE: 25025] [RE: 25804] [DI : complement] 



ORF Name 



NT AA 

NTID AAID — , «^ _ — . , score probab ility 
LengthLength c L 



|A:r7503000983 23850952 t3 525 



4247 11515 



T7T~ 



1453 ||7.4e-43 



Description 

sp: [LN:PMSR_BACSU] [AC:P54154] [GN:YPPP] [OR:BACILLUS SUBTILIS] [DE : REDUCTASE) ] 
[SP:P54154] [DB : swissprot] >pir : [LN:E69940] [AC:E69940 ] [PN: peptide methionine 
sulfoxide reductase homolog yppP] [GN:yppP ] [CL: peptide methionine sulfoxide 
reductase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl256653] [LN : BACYACA] [AC:L77246] 
[PN:DNA-binding protein] [GN:yppP] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis (YAC10-9 clone) DNA region between the serA andkdg loci.] 
[NT:42.4% identity with the Lycopersicon esculentum] [LE:33389] [RE:33922] [DI:direct] 
>gp: [GI :ell83616 :g2 634589] [LN : BSUB0012 ] [AC : Z99115 : AL009126] [GN:yppP] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
12 of 21): from 2195541to 2409220.] [NT:similar to peptide methionine sulfoxide 
reductase] [SP:P54154] [LE: 91230] [RE: 91763] [D I : complement] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



LAI7503000983 23944052 cl 789 



] | 47 ^ | |4248 [|125 



probability 
i.7e-20 



Description 

sp : [LN : CATB_BACFR] [AC:P45737] [GN: KATB] [OR : BACTEROIDES FRAGILIS] [EC : 1 . 11 . 1 . 6] 
[DE:CATALASE # ] [SP:P45737] [DB : swissprot] >pir : [LN : A57262] [AC:A57262 ] [PN : catalase , ] 
[GN:katB ] [CL: catalase] [OR:Bacteroides fragilis] [EC : 1 . 11 . 1 . 6] [DB:pir2] 
>gp: [GI:g841192] [LN : BFU18676] [AC:U18676] [PN : catalase] [GN:katB] [FN : decomposes 
hydrogen peroxide in water and] [OR:Bacteroides fragilis] [DB :genpept-bct2] 
[EC: 1.11. 1.6] [DE:Bacteroides fragilis catalase (katB) gene, complete cds . ] [LE:348] 
[RE: 1808] [DI: direct] 



ORF Name 



[AI7503000963 23992812 cl 793 



Description 
NO-HIT 



NTID 



„ Tn NT AA 

AAID — — , score 

LengthLength 



probability 



[4249 |p^9~| 



ORF Name 



NT AA 
NTID AAID — — score 
LengthLength 



probability 



IAI7503000983 24017127 ci 759 



] |478 | [4250 |p3~||134 | pr" ||9 . 3e-52 



Description 

gp: [GI:g468509] [LN : SAGLNAR] [AC:X76490] [PN : glutamine synthetase repressor] [GN:glnR] 
[OR staphylococcus aureus] [DB :genpept-bctl] [DE:S. aureus (bb2 70) glnA and glnR genes.] 
[LE:975] [RE:1343] [DIrdirect] 



185 



ORF Name 



NT ID 



NT 

AAID _ 



AA 



LengthLength 



score 



IAI 7 5030009U3 24033217 c2 850 



probability 
] I 479 1 | 4251 |P°~~1P Q9 1P° ll j -^ e - 71 



Description 

sp: [LN:TRUB_BACSU] [AC:P32732] [GN: TRUB] [OR : BACILLUS SUBTILIS] [EC : 4 . 2 . 1 . 70] 
[DE : HYDROLYASE) ] [SP:P32732] [DB : swissprot] >pir : [LN : G6 9726] [AC :G69726 :H36905 : S31997 ] 
[PN: tRNA pseudouridine 5S synthase truB:P3 5 homolog] [GNrtruB ] [CL : Escherichia coli 
protein P35] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85257 : g2634038] [LN : BSUB0009] 
[AC : Z99112 : AL009126] [PN : tRNA pseudouridine 55 synthase] [GN : truB] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DErBacillus subtilis complete genome (section 9 of 21): 
from 1598421to 1807200.] [NT : alternate gene name: ylxN, ymxF] [SP:P32732] [LE:137857] 
[RE : 138786] [DI : direct] 



ORF Name 



NTID 



_ _ «— NT AA 

AAID — . T — score 

LengthLength 



AI7503000983 24078753 C3 999 



TOT 



[4252 |p?U~[p^ |F^" 



probability 
|4.1e-26 



Description 

pir: [LN:G69657] [AC:G69657 ] [PN : tRNA isopentenylpyrophosphate transferase miaA] 
[GN:miaA ] [CL : delta (2 ) -isopentenylpyrophosphate transferase] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : el 183392 :g2 634117] [LN:BSUB0010] [AC : 299113 : AL009126] [PN: tRNA 
isopentenylpyrophosphate transferase] [GN:miaA] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 10 of 21) : from 
1781201tO 2014980.] [LE:84430] [RE:85374] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



probability 



[AI7503000983 24222137 F2 341 



Description 
NO-HIT 



1Pn F r 1 p 1F 1 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



probability 



IAI7503000983 24225000 t3 475 



2 |482 | [4254 |[g5T"| |297 ||360 ||S.3e-33 



Description 

gp: [GI:g2194195] [LN : SGU61158] [AC:U61158] [PN: GdmF] [GN:gdmF] [OR : Staphylococcus 
gallinarum] [DB : genpept-bctl] [DE : Staphylococcus gallinarum Tue3928 GdmF (gdmF) , 
putative membraneprotein (gdmH) , ABC transporter (gdmT) , and antibiotic 
galliderminprecursor (gdmA) genes, complete cds, putative membrane protein (gdmE) and 
modifying enzyme (gdmB) genes, partial cds . ] [NT:proposed ABC transporter subunit 
(ATP-binding] [LE:179] [RE: 874] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



E 



17503000583 24225053 c3 1031 



][ 



[4255 |pT5 [ |105 |pT2~ 



probability 
2.9e-16 



Description 

pir: [LN:E69894] [AC:E69894 ] [PN : hypothetical protein ynzC] [GN:ynzC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell83447:g2 634172] [LN : BSUB0010] [AC : Z99113 : AL009126] 
[GN:ynzC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 10 of 21): from 1781201to 2014980.] [LE:137514] [RE:137747] 
[DI : direct] 



ORF Name 



NTID 



NT 

AAID 



AA 



Length Length 



score 



^7503000983 2425146S cl 799 



][ 



[4256 Ip 7 ^] 



probability 
13437 110. 0 



Description 

sp: [LN:ACON_BACSU] [AC : P0933 9 : Q45059] [GN : CITB] [OR:BACILLUS SUBTILIS] [EC:4.2.1.3] 
[DE : ACONITATE HYDRATASE, (CITRATE HYDRO-LYASE) (ACONITASE) ] [SP : P0933 9 : Q45059] 
[DB: swissprot] 



186 



ORF Name 



NT AA 
NTID AAID — « _ — scor e 
Length Length 



probability 



IAI75030009UJ 24257252 cl 727 



Description 
NO-HIT 



ORF Name 



NTID 



NT 

AAID _ 



AA 



LengthLength 



score 



[AI7503000983 242&7941 ci 745 



j [486 | [4258 |[725~||24i | [? 



probability 
4.5e-27 



Description 

sp: [LN:YMFC_BACSU] [AC:031761] [GN:YMFC] [OR : BACILLUS SUBTILIS] [DE : REGION] [SP:031761] 
[DBrswissprot] >pir: [LN:B69885] [AC:B69885 ] [PN: transcription regulator GntR family 
homolog ymfC] [GN:ymfC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85272 :g2634053] 
[LN:BSUB0009] [AC : Z99112 : AL009126 ] [GN:ymfC] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DErBacillus subtilis complete genome (section 9 of 21) : from 
1598421to 1807200.] [NT: similar to transcriptional regulator (GntR family)] [SP:031761] 
[LE:155754] [RE:156479] [DI rdirect] 



ORF Name 



NT 



AA 



NTID AAID , — 



LengthLength 



score 



E 



17503000983 24267942 cl 823 



j |487 | [4259 ||ruW]|362 j pOO | [ 



probability 
1.2e-2S 



Description 

sp : [LN : ALR_BACSU] [AC : P10725 : P96620] [GN: ALR :DAL] [OR : BACILLUS SUBTILIS] [EC:5. 1.1.1] 
[DE : ALANINE RACEMASE,] [SP : P1072 5 : P9662 0] [DB : swissprot ] >gp : [GI : dl02 0054 : gl8812 74] 
[LN:AB001488] [AC : AB001488] [PN : ALANINE RACEMASE] [GN:alr] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain:168) DNA] [DB :genpept-bctl] [EC : 5 . 1.1.1] [DE:Bacillus 

subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] 
[LE:50435] [RE:51604] [DI:direct] >gp: [GI : el 182430 :g2 632764] [LN : BSUB0003] 
[AC:Z99106 :AL009126] [PN : D-alanine racemase] [GN:dal] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC: 5. 1.1.1] [DE:Bacillus subtilis complete genome (section 3 of 21) 

from 402751 to611850.] [NT : alternate gene name: air] [SP:P10725] [LE:114179] 
[RE : 115348] [DI : direct] 



ORF Name 



NTID 



[AI7503000983 24275017 c2 856 



Description 
NO-HIT 



NT AA 
AAID — . T — , , score 
Length Length 

[4260 jfT52 | |47 [ 



probability 



ORF Name 



NTID 



_ _ -pT^ NT ^ 

AAID — , — score 

LengthLength 



^£7503000983 24552200 ±3 600 



|4261 |P^3"] 



probability 
2.8e-£S 



Description 

pir: [LN:T02833] [AC:T02833 ] [PN : hypothetical protein L4171.5] [OR : Leishmania major] 
[DB:pir2] [MP:1 ] >gp : [GI :g2266911] [LN: AE001274] 

[AC:AE0012 74 : AC003 011 : AC002552 :U6 0409 : AF0082 05 :AC002134 :AF008206 :U7 02 53] [PN:L4171 . 5] 
[GN:L4171.5] [OR : Leishmania major] [DB : genpept - inv2 ] [DE : Leishmania major chromosome 1, 
complete sequence.] [NT: similar to threonine aldolase; leucine zipper] [LE: 124709] 
[RE: 125788] [DI :direct] 



ORF Name 



NTID 



NT AA 
AAID — — . score 
LengthLength 



probability 



AI7503000983_24407758_r3_696 

Description 

NO-HIT 



J [490 | [4252 |[T?5~~||42 | 



187 



ORF Name 



NT AA 
NT ID AAID — . _ — score 
LengthLength - 



probability 



IA17503000983 24412811 c3 99 7 



J [491 [ [ 4263 | pT7— [ pi6 j flTT" || 4 . 4e-ii 



Description 

pir : [LN:T02661] [AC:T02661 ] [PN : lysophospholipase homolog] [GN:LPL1 ] [OR:Oryza 

sativa] [SR:, rice] [DB:pir2] >gp : [GI :g2801536] [LN: AF039531] [AC : AF039531] 

[PN: lysophospholipase homolog] [GNrLPLl] [OR:Oryza sativa] [DB : genpept -pln2 ] [DE:Oryza 

sativa lysophospholipase homolog (LPL1) mRNA, complete cds.] [LE:46] [RE: 960] 

[DI :direct] 



ORF Name 



NT ID 



„ TT ^ NT AA 

AAID — -i _ — , , score 

LengthLength 



probability 



1AS7503000983 24414187 c3 1003 



] I 492 | | 4264 — | — ||0.0ii 



Description 

gp: [GI:g42727] [LN : ECRHAT] [AC:X60699] [GN : sodA] [OR Escherichia coli] 
[DB:genpept-bctl] [DE:E. coli rhaT gene for L-rhamnose permease, rhaC (3 1 end) and 
sodA(5' end) genes.] [SP:P00448] [LE:<1] [RE: 288] [DI : complement] 



ORF Name 



NTID 



_ _ _ _ NT AA 

AAID — — , , score 

LengthLength 



IAI7503000983 24414818 cl 738 



] [493 | [4265 ||87Q-|[289 ||952 | [ 



probability 
|9.8e-96 



Description 

gp: [GI:ell85251:g2634032] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN: nusA] 
[FN: transcription termination] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [LE:133252] 
[RE : 134367] [DI : direct] 



ORF Name 



NTID 



_ _ __ NT AA 

AAID — — score 

Length Length 



probability 



IA1 7 503000983 24415933 c3 1078 



j [494 | [4266 |pi— [|5i | 



Description 
NO-HIT 



ORF Name 



NTID AAID — —^^ score 
LengthLength 



probability 



[A17503000983 24417512 13 505 



] [495 | [4267 |p^~[| 44 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



IA17503000983 24470927 12 358 



Description 
NO-HIT 



][ 



[4258 



1138 I 



ORF Name 



IAI7503000983 24475252 c3 1052 



Description 
NO-HIT 



NTID AAID — _ „ — i , score 
LengthLength 

|4 97 [ [4269 [JTZS [ [41 | 



probability 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000983_24484683_c3J-041 | |498 | [4270 |p^~[ p" 



Description 
NO-HIT 



188 



ORF Name 



NTID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000983 24492527 ti 223 



] |499 | [4271 ||260r||866 | ppgg"||3 . 9e-108 



Description 

pir: [LN:G69801] [AC:G69801 ] [PN : hypothetical protein yfhO] [GNryfhO ] [OR: Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell82850 :g2633184] [LN : BSUB0005] [AC : Z99108 : AL009126] 
[GN:yfhO] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21): from 802821 tol011250.] [LE:128691] [RE:131150] 
[DI:direct] >gp: [GI : dl025397 :g2804545] [LN:D85082] [AC:D85082] [PN:YfhO] [OR:Bacillus 
subtilis] [SRrBacillus subtilis DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 
genome sequence, 79 to 81 degree region.] [LE:21582] [RE:24041] [DI:direct] 



ORF Name 



UI7503000983 24435928 cl 772 



NTID 



AAID 



TTTT 



NT AA 
LengthLength 



59" 



score probability 

rro — 110.028 



Description 

pir: [LN:G69058] [AC:G69058 ] [PN : hypothetical protein MTH1440] [GN:MTH1440 ] 
[OR:Methanobacterium thermoautotrophicum] [DB:pir2] >gp : [GI : g2622552] [LN: AE000905] 
[AC:AE000905:AE000666] [PN:unknown] [GN:MTH1440] [OR : Methanobacterium 
thermoautotrophicum] [DB :genpept-bct2] [DE : Methanobacterium thermoautotrophicum from 
bases 1293944 to 1305586 (section 111 of 148) of the complete genome.] [NT:Function 
Code:14.00 - Unknown, ; similar to,] [LE:7011] [RE:7214] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



IA17503000983 2453557 cl 755 



Description 
NO-HIT 



] |50i | [4273 |[ITr-]|57 | 



ORF Name 



NTID 



„ 7VTr , NT AA 

AAID — . T — , , score 

LengthLength 



|AlVb03000983 24531537 c3 1002 



4274 I [TITS' 



TIT 



probability 
7.3e-20 



Description 

gp: [GI:g2444107] [LN:U88974] [AC:U88974] [PN:ORF28] [OR : Streptococcus thermophilus 
temperate bacteriophage O1205] [DB : genpept -phg] [DE : Streptococcus thermophilus 
temperate bacteriophage O1205, completegenome . ] [LE: 17062] [RE: 17955] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ „ — u score 
LengthLength 



probability 



IAI7503000983 24540325 c3 1037 



] F 03 | | 4275 |p7T]|423 |pg8ip.4e-i3i 



Description 

sp: [LN:OPUD_BACSU] [AC:P54417] [GN:OPUD] [OR:BACILLUS SUBTILIS] [DE : GLYCINE BETAINE 
TRANSPORTER OPUD] [SP:P54417] [DB : swissprot] >pir : [LN: G69670] [AC:G69670 ] [PN:glycine 
betaine transporter opuD] [GN:opuD ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85880:g2635491] [LN : BSUB0016] [AC : Z99119 : AL009126] [PN:glycine betaine 
transporter] [GN:opuD] [FN: osmoprotection] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 3213410.] 
[NT alternate gene name: ytfQ] [SP:P54417] [LE: 78128] [RE: 79666] [DI: direct] 
>gp: [GI:g2293330] [LN: AF008220] [AC : AF00822 0] [PN:putative transporter] [GN:opuD] 
[OR:Bacillus subtilis] [DB : genpept -bct2] [DE:Bacillus subtilis rrnB-dnaB genomic 
region.] [LE:100760] [RE:102298] [DI : complement] >gp : [GI : gl524397] [LN:BSU50082] 
[AC:U50082] [PN:glycine betaine transporter OpuD] [GN:opuD] [OR:Bacillus subtilis] 
[DB: genpept -bet 2] [DE:Bacillus subtilis glycine betaine transporter OpuD (opuD) 
gene, complete cds . ] [LE:460] [RE:1998] [DI:direct] 



189 



ORF Name 



NT 



AA 



™ AAI5 Le ^thLe^th ^^ probability 



LL7503000983 24647176 c2 928 



Description 

gp: [GI:g4981173] [LN: AE001738] [AC : AE001738 : AE000512] [PN : conserved hypothetical 
protein] [GN:TM0651] [OR : Thermo toga maritima] [DB :genpept-bct2] [DE : Thermo toga maritima 
section 50 of 136 of the complete genome.] [NT:similar to PID:1653547 percent identity: 
52.43;] [LE: 13445] [RE: 14251] [DI : complement] 



ORF Name 



NTID 



AAID . 



E 



17503000983 24647182 cl 821 



TTfT 



NT AA 
LengthLength 
TZ5 



score probability 
1232 ||3.9e-33 



Description 

gp: [GI:g4982086] [LN : AE001799] [AC : AE001799 : AE000512] [PN: dihydrodipicolinate 
reductase] [GN:TM1520] [OR : Thermotoga maritima] [DB : genpept-bct2] [DE : Thermotoga 
maritima section 111 of 136 of the complete genome.] [NT: similar to PID: 1185002 percent 
identity: 62.67;] [LE: 16971] [RE:17621] [DI : complement] 



ORF Name 



NTID AAID — , — , score 
Length Length 



probability 



|AI7S03000983_245B9382_c2_890" 

Description 

NO-HIT 



"] [505 I |4278 ||T25 I |3T 



NT AA 

ORF Name NTID AAID — — ^ score probability 
Length Length L - 



A17i>03000983_24666043_cl_803 | |507 | [42 7 9 | pTT] [ 410 j | 866 || 1.3e-86 



Description 

pir: [LN:D70006] [AC:D70006 ] [PN : conserved hypothetical protein yubA] [GN:yubA ] 
[CL: Bacillus subtilis conserved hypothetical protein yueF] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell85989 :g2635600] [LN:BSUB0016] [AC : Z99119 : AL009126] [GN : yubA] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 16 of 21): from 2997771to 3213410.] [NT:similar to hypothetical 

proteins] [LE: 196858] [RE: 198024] [DI : complement] 

NT AA 

ORF Name NTID AAID t — _ t — ^ score probability 
LengthLength L - 



|AI7S030003fl3_24744010_c3_1028 | [508 | [ 4280 |p73~| |9Q |j 347 || 1.3e-31 

Description 

pir : [LN:F69835] [AC:F69835 ] [PN : ribosomal protein S14 homolog yhzA] [GN : yhzA ] 
[CL: Escherichia coli ribosomal protein S14] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell82877:g2633211] [LN: BSUB0005] [AC : Z99108 : AL009126] [GN: yhzA] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NT:similar to ribosomal protein S14] [LE:162598] 
[RE: 162867] [DI: direct] 



190 



NT AA 

ORF Name NT ID AAID Le ^ t frLength score probability 



A1750300098JJ^4792V7l>_c2Jj46 | | 509 | [4281 || 528 1 1175 |p 7 1 || 3.6e- TT 



Description 

sp: [LN:YLXS_BACSU] [AC:P32726] [GN : YLXS] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 17.6 
KD PROTEIN IN NUSA 5' REGION (P15A) (ORF1) ] [SP:P32726] [DB : swissprot] >pir : [LN:B36905] 

[AC:B36905 :E69882 :S31990 ] [PN : conserved hypothetical protein ylxS] [GN:ylxS ] [CL:nus 
operon 15K protein] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g49315] [LN: BS0RF1T7A] 

[AC:Z18631] [GN:0RF1] [ORiBacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis infB-nusA 
operon.] [SP:P32726] [LE:456] [RE:926] [DI:direct] >gp : [GI : ell85250 : g2634031] 

[LN:BSUB0009] [AC : Z99112 : AL009126] [GN:ylxS] [FN:unknown] [ORrBacillus subtilis] 

[DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 9 of 21) : from 
1598421to 1807200.] [NT : alternate gene name: ymxA; similar to hypothetical] [SP:P32726] 

[LE: 132 747] [RE: 133217] [DI : direct] 



NT AA 

ORF Name NT ID AAID — .„ — _ score probability 
LengthLength ^ 



AI7503000983_24823377_r3_498 [ |5i0 | [4282 | 303 | p2~ || 8 . 2e-44 



Description 

pir: [LN:E69840] [AC:E69840 ] [PN : hypothetical protein yitL] [GN:yitL ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : el 183105 :g2 63 34 39] [LN: BSUB0006] [AC : 299109 :AL009126] 
[GN:yitL] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 6 of 21): from 999501 tol209940.] [LE:181473] [RE:182369] 
[DI: direct] 

NT AA 

ORF Name NTID AAID — . — , , score probabi lity 
LengthLength 



AI7503000983_24877312J:i_10S | |511 | [4283 ||1392 1 |453 ||1093 ||1 . le-110 



Description 

pir: [LN:A69763] [AC:A69763 ] [PN : homoserine dehydrogenase homolog yclM] [GN:yclM ] 
[CL: aspartate kinase homology] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI :ell82346 :g2632680] [LN: BSUB0003] [AC : Z99106 : AL009126] [GN:yclM] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [NT:similar to homoserine dehydrogenase] [LE:27453] 
[RE:28817] [DI : complement ] >gp : [GI : dl009646 :gl805449] [LN:D50453] [AC:D50453] 
[PN:homologue of aspartokinase 2 alpha and beta] [GN:yclM] [OR: Bacillus subtilis] 
[SR: Bacillus subtilis (strain: 168 trpC2) DNA] [DB : genpept-bctl] [DE: Bacillus subtilis 
DNA for 25-36 degree region containing theamyE-srfA region, complete cds . ] [LE: 109856] 
[RE: 111220] [DI : complement ] 



NT AA 

ORF Name NTID AAID r — — , , score probabi lity 

LengtnLengtn 

|AI7503000983_24886550_clJ741 | |512 | [4284 | | 28ff ~] | 95 | pbO || 6.ie-32 



Description 

pir : [LN:F69700] [AC : F69700 : S11365 : S70690 ] [PN : ribosomal protein S15 (rpsO) :ribosomal 
protein BS18] [GN:rpsO ] [CL : Escherichia coli ribosomal protein S15 : eubacterial 
ribosomal protein S15 homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:e269878:gl592691] [LN : BSRIBRPS] [AC:Z80835] [PN : ribosomal protein S15] 
[GN:rpsO] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE :B . subtilis ribC, rpsO and pnpA 
genes.] [SP:P21473] [LE:1195] [RE:1464] [DIrdirect] >gp : [GI : ell85259 : g2634040] 
[LN:BSUB0009] [AC : Z99112 : AL009126] [PN : ribosomal protein S15 (BS18)] [GN:rpsO] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [SP:P21473] [LE:139912] [RE:140181] [DI:direct] 



191 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 

|AI7B030009 fl3JMBB6fc7 V_cJ_97i | [ 513 [ [573 1 p^F] [i . Je-194 

Description 

pir: [LN:G69682] [AC:G69682 ] [PN : proline- - tRNA ligase, proS : prolyl -tRNA synthetase] 
[GNiproS ] [CL: proline --tRNA ligase] [OR:Bacillus subtilis] [EC : 6 . 1 . 1 . 15] [DB:pir2] 
>gp: [GI:ell85248:g2634029] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : prolyl -tRNA 
synthetase] [GNiproS] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [LE:126301] [RE:127995] 
[DI: direct] 



NT AA 

QRF Name NT ID AAID — . T — , score probability 
Length Length c L - 



&I7B03000983_24S00332_c2_fl70 | |5i4 | [ 4286 |p^~| |32g || 935 || 6.2e- "gT 



Description 

sp: [LN:MUTL_BACSU] [AC:P49850] [GNrMUTL] [OR : BACILLUS SUBTILIS] [DE:DNA MISMATCH REPAIR 
PROTEIN MUTL] [SP:P49850] [DB : swissprot] >pir : [LN: A69663] [AC:A69663 ] [PN:DNA mismatch 
repair mutL] [GN:mutL ] [CL: mismatch repair protein hexB] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI:gl002521] [LN : BSU27343] [AC:U27343] [PN :MutL] [GN:mutL] 
[FN:mismatch-repair recognition] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis spore coat protein (cotE) gene, partial cds, andmismatch repair recognition 
proteins (mutS) and (mutL) genes , complete cds.] [LE:2782] [RE: 4665] [DI:direct] 
>gp: [GI:ell85296:g2634077] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:mutL] [ FN : DNA 
mismatch repair] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P49850] [LE:179285] 
[RE:181168] [DI :direct] 

NT AA 

QRF Name NT ID AAID — — , score probability 
Length Length c J - 



|A17b03000983_2S44S2&3_c3_1039 | | [4287 | |495 ~| |lg4 | p5~ | [1 . 8e-30 | 

Description ~~ 
pir: [LN:F69891] [AC:F69891 ] [PN : conserved hypothetical protein yneP] [GN:yneP ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e249652 :gl405456] [LN : BC170DEGR] [AC:Z73234] 
[PN:YneP] [GN:yneP] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis DNA (26.2 
kb fragment; 170 degree region).] [LE:12510] [RE:12875] [DI:direct] 

>gp: [GI : el 183462 :g2 634187] [LN : BSUB0010] [AC : Z99113 : AL009126 ] [GN:yneP] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
10 of 21): from 1781201to 2014980.] [NT:similar to hypothetical proteins] [LE:148889] 
[RE : 149254] [DI : direct] 



NT AA 

QRF Name NTID AAID — , — , score probability 

LengthLength £ - 



AI75030009S3_55449051_clJ780 | [515 | [4288 | p^~| | 82 | | 191 | |4.3e-15 



Description 

pir: [LN:D69901] [AC:D69901 ] [PN : two- component response regulator [YocF] homolog yocG] 
[GN : yocG ] [CL : regulatory protein comA: response regulator homology] [OR: Bacillus 

subtilis] [DB:pir2] >gp : [GI :g2619014] [LN : AF027868] [AC : AF02 7868] [PN:sensor regulator] 
[GN :yocG] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis chromosome 

region between terC and odhAB.] [NT: similar to B. subtilis SpoOA protein (267 aa) ] 
[LE:71699] [RE:72298] [DI:direct] >gp : [GI : ell85392 :g2634313] [LN : BSUB0011] 
[AC:Z99114:AL009126] [GN:yocG] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 11 of 21): from 2000171to 2207900.] 
[NT:similar to two - component response regulator [YocF]] [LE:90790] [RE:91389] 
[DI : direct] 



192 



ORF Name 



NTID 



NT AA 
AAID — , _ — sco re 
LengthLength 



probability 



IA1 7 503000983 25578140 t3 612 



|517 | [4289 ||132~||43 | 



Description 
NO-HIT 



ORF Name 



NTID 



_ _ _ NT AA 

AAID — , — .score 

LengthLength 



probability 



p7503000983_25587942_c3_9g(5 



][ 



[4290 |[Tgg — | pgl ||S55 ||9.0e-54 



Description 

pir : [LN:C69693] [AC:C69693 ] [PN : ribonuclease H rnh] [GNrrnh ] [CL : ribonuclease HII] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85197 :g2633978] [LN : BSUB0009] 
[AC:Z99112 :AL009126] [PN : ribonuclease H] [GN:rnh] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC : 3 . 1 . 26 . 4] [DE:Bacillus subtilis complete genome (section 9 of 
21): from 1598421to 1807200.] [LE: 78430] [RE:79197] [DI: direct] 



ORF Name 



E 



17503000983 25593925 tl 166 



Description 
NO-HIT 



NTID 



][ 



„ TO NT AA 

AAID — — score 

LengthLength 



probability 



|4291 |f!53 | 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



IAI7503000983 25626625 c2 891 



][ 



|4292 j |211 | 



Description 
NO-HIT 



ORF Name 



NTID AAID — , , score 
LengthLength 



probability 



17503000983 25664512 c2 915 



|521 | [4293 ||495~||164 | p7^~||2 . Oe-24 



Description 

pir: [LN:A69805] [AC:A69805 ] [PN : hypothetical protein yfiW] [GN:yfiW ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell82832 :g2633166] [LN : BSUB0005] [AC : Z99108 : AL009126] 
[GN:yfiW] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21) : from 802821 tol011250.] [LE: 113464] [RE: 114240] 
[DI:direct] >gp: [GI : dl025379 :g2804527] [LN:D85082] [AC:D85082] [PNrYfiW] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA, 
genome sequence, 79 to 81 degree region.] [LE:6355] [RE: 7131] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID — — v 
LengthLength 



score 



LL7503000983 25665937 c3 968 



J |522 | [4294 |[^~| |185 |[^~| [ 



probability 
5.7e-84 



Description 

sp : [LN : RRF_STAAU] [AC:033276] [GN: FRR] [OR : STAPHYLOCOCCUS AUREUS] [DE : PROBABLE RIBOSOME 
RECYCLING FACTOR (RIBOSOME RELEASING FACTOR) (RRF) ] [SP:033276] [DB : swissprot] 
>gp: [GI:g2645713] [LN: AF033018] [AC : AF033018] [PN:ribosome recycling factor] [GN:frr] 
[OR staphylococcus aureus] [DB : genpept-bct2] [DE : Staphylococcus aureus ribosome 
recycling factor (frr) gene , complete cds.] [LE:1] [RE:555] [DI:direct] 



193 



NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 

| A1 7 5030009aa_ai»VBJ7_cJ_10ia | | 523 | [4295 || 1 7 4 | |b7 |p || O.U04b -| 

Description 

gp: [GI:g2668605] [LN : AF015453] [AC : AF015453] [PN:unknown] [OR : Lactobacillus rhamnosus] 
[DB :genpept-bct2] [DE : Lactobacillus rhamnosus 6-phospho-beta-glucosidase homolog 
gene, partial cds; GNTR transcriptional regulator homolog and surf acelocated protein 
genes, complete cds.] [NT : 3 . OE-ORF-1] [LE:2236] [RE:>3603] [DI : complement] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



|&I7S030009fl3_2B3773ia_c3_1044 | |524 | [ 4296 |p?TF] | 804 |pT7TT| [0.0 

Description 

Sp : [LN : PARC_STAAU] [AC : P50073 : P95682 : P95683 ] [GN : PARC : GRLA] [OR : STAPHYLOCOCCUS AUREUS] 
[EC:5.99.1. -] [DE : TOPOISOMERASE IV SUBUNIT A, ] [SP : P50073 : P95682 : P95683] [DB : swissprot] 
>gp: [GI:dl011747:gl777321] [LN:D67075] [AC:D67075] [PN:DNA topoisomerase IV GrlA 
subunit] [GN: grlA] [OR : Staphylococcus aureus] [SR : Staphylococcus aureus 
(isolate :RN4220) DNA] [DB :genpept-bctl] [DE : Staphylococcus aureus DNA for DNA 
topoisomerase IV GrlB subunit, DNA topoisomerase IV GrlA subunit, complete cds.] 
[LE:2376] [RE:4778] [DI: direct] 



NT AA 

ORF Name NTID AAID — — , , score probability 

LengthLength c 

AI7503000983_26205387_r2_331 | |525 | [ 4297 |p35~| [ 111 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

AI7g03000983_2621092b_c3_1032 | |525 | [ 4298 |pU7^| 675 | [2097 ||4 . 5e-217 



Description 

sp: [LN:TKT_BACSU] [AC:P45694] [ GN : TKT : TKTA] [OR:BACILLUS SUBTILIS] [EC:2.2.1.1] 
[DE : TRANS KETOLASE , ] [SP:P45694] [DB : swissprot ] >pir : [LN : G69723 ] [AC : G69723 : S57401 ] 
[PN: transketolase, tkt] [GN:tkt ] [CL : transketolase : thiamine pyrophosphate -binding 
domain homology] [OR: Bacillus subtilis] [EC: 2. 2. 1.1] [DB:pir2] 

>gp: [GI:e249642:gl405446] [LN : BC170DEGR] [AC:Z73234] [PN : transketolase] [GN:tktA] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis DNA (26.2 kb fragment; 170 

degree region).] [SP:P45694] [LE:1537] [RE:3540] [DI:direct] >gp : [GI : ell83448 :g2634173] 
[LN:BSUB0010] [AC : Z99113 : AL009126 ] [PN : transketolase] [GN:tkt] [FN:pentose phosphate] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.2.1.1] [DE:Bacillus subtilis complete 

genome (section 10 of 21): from 1781201to 2014980.] [NT : alternate gene name: tktA] 
[SP:P45694] [LE:137916] [RE:139919] [DI: direct] 



NT AA 

ORF Name NTID AAID — _ T — , score probab ility 
LengthLength c i 



AI7503000983_262i3890_c3_i02g | |527 | [ 4299 || g99 ~| | 232 || 757 || 4.6e-7B 



Description 

gp: [GI:e313391:g2052219] [LN : SCDNACAT] [AC:X96981] [PN : catalase] [GN : catA] 
[OR:Streptomyces coelicolor] [DB : genpept-bctl] [DE : S . coelicolor catA gene.] [LE:392] 
[RE: 1858] [DI:direct] 



194 



NT AA 

0RF Name J™ ™I5 Le — thLe — th score probability 



AI 7 503000983J^2S780^c3J^3 [ | 528 | [ 4300 [ p^T"] [327 | [7P7-"| p . Oe-70 



Description 

sp: [LN : MUTL_BACSU] [AC:P49850] [GNrMUTL] [OR : BACILLUS SUBTILIS] [DE:DNA MISMATCH REPAIR 
PROTEIN MUTL] [SP:P49850] [DB : swissprot] >pir : [LN: A69663] [AC:A69663 ] [PN:DNA mismatch 
repair mutL] [GN:mutL ] [CL: mismatch repair protein hexB] [OR: Bacillus subtilis] 
[DB:pir2] >gp : [GI :gl002521] [LN : BSU27343] [AC:U27343] [PNrMutL] [GN:mutL] 
[FN: mismatch- repair recognition] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis spore coat protein (cotE) gene, partial cds, andmismatch repair recognition 
proteins (mutS) and (mutL) genes , complete cds . ] [LE:2782] [RE:4665] [DIrdirect] 
>gp: [GI:ell85296:g2634077] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN : mutL] [ FN : DNA 
mismatch repair] [OR: Bacillus subtilis] [DB : genpep t - be t 1 ] [DE: Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P49850] [LE:179285] 
[RE:181168] [DI :direct] 



NT AA 

ORF Name NT ID AAID — _ „ — , , score prob ability 
Length Length c 



A17503000383_26306257_c2_835 | |523 | [4301 | pU73"| | 590 | [2490 | |i . Oe-258 



Description 

sp: [LN:T0P1_BACSU] [AC:P39814] [GN : TOPA : TOPI] [OR:BACILLUS SUBTILIS] [EC : 5 . 99 . 1 . 2] 
[DE: (UNTWISTING ENZYME) (SWIVELASE) ] [SP:P39814] [DB : swissprot] >pir : [LN : G69724] 
[AC:G6 9724 ] [ PN : DNA topoisomerase I topA] [GN : topA ] [CL : DNA topoisomerase I] 
[OR: Bacillus subtilis] [DB:pir2] >gp : [GI : g520753] [LN : BACSMF] [AC:L27797] [ PN : DNA 
topoisomerase I] [FN: DNA unwinding protein: removes negative] [OR: Bacillus subtilis] 
[SR:Bacillus subtilis (strain 8G5) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis (smf) 
gene, 3' end, DNA topisomase gene, completecds, (gid) gene, 5 ! end.] [LE:673] [RE:2748] 
[DI:direct] >gp : [GI :ell85203:g2 633984] [LN :BSUB0009] [AC : Z99112 : AL009126] [PN: DNA 
topoisomerase I] [GN : topA] [ FN : DNA unwinding protein removing negative] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [EC : 5 . 99 . 1 . 2] [DE:Bacillus subtilis complete genome 
(section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: topi] [SP:P39814] 
[LE:84640] [RE:86715] [DI: direct] >gp : [GI : e332180 : g2462 970] [LN : BSYLQGCOD] 
[AC:AJ000975] [ PN : DNA Topoisomerase I] [GN: topA] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis ylqg to codV gene region.] [SP:P39814] 
[LE:3808] [RE:5883] [DI:direct] 



NT AA 

ORF Name NT ID AAID — _ T — , , score prob ability 
Length Length 



AI7503000983_2g353417_ciJ753 | | b30 | [ 4302 [97 | [ 141 || 8.5e-10 



Description 

sp: [LN:YVI2_CLOPE] [AC:Q46213] [OR : CLOSTRIDIUM PERFRINGENS] [DE : HYPOTHETICAL 10.7 KD 
PROTEIN IN VIRR 5' REGION (ORF2) ] [SP:Q46213] [DB : swissprot] >pir : [LN: S49553] [AC:S49553 
] [PN: hypothetical protein 2] [OR : Clostridium perfringens] [DB:pir2] >gp : [GI :g498839] 
[LN:CPVIRRS] [AC:U04966] [FN:unknown] [OR : Clostridium perfringens] [DB : genpept-bctl] 
[DE: Clostridium perfringens JIR4025 extracellular toxin productionregulatory locus ORF1 
and ORF3 genes, partial cds, and ORF2,ORF10c, virR, virS, and ORF4 genes, complete 
Cds.] [NT : ORF2] [LE:469] [RE: 756] [DI: direct] 



195 



ORF Name 



NTID 



NT AA 
AAID — _ — ,,,, score 
Length Length 



probability 



U7S030009B3 2635483 7 c2 945 



] | 531 1 | 4303 |pnT"||299 ||57Q— j|2.9e-55 



Description 

Sp : [LN : DAPA_METJA] [AC:Q57695] [GN : DAPA : M JO 2 44] [OR : METHANOCOCCUS JANNASCHII] 
[EC:4.2.1.52] [DE : DIHYDRODIPICOLINATE SYNTHASE , (DHDPS) ] [SP:Q57695] [DB : swissprot] 
>pir: [LN:E64330] [AC:E64330 ] [PN : dihydrodipicolinate synthase,] [OR : Methanococcus 
jannaschii] [EC : 4 . 2 . 1 . 52] [DB:pir2] [MP : REV2 33451-232582 ] >gp : [GI : gl590977] 
[LN:U67480] [AC : U6 7480 :L7 7117] [PN : dihydrodipicolinate synthase (dapA) ] [GN:MJ0244] 
[OR: Methanococcus jannaschii] [DB : genpept-bct2] [DE : Methanococcus jannaschii section 22 
of 150 of the complete genome.] [NT:similar to GB:L08471 SP:Q04796 PID:142830] [LE:511] 
[RE : 1380] [DI : complement] 



ORF Name 



NT 

NTID AAID 



AA 



LengthLength 



score probability 



A17503000983_26369016_cl_773 | |532 | [4304 |p7~| |128 | |Tu3~[ |5 . 6e-06 



Description 

gp: [GI:el39437:gl369938] [LN:BTP9011] [AC:X84706] [PN : scaffolding protein] [GN :bl 
(sfp)] [OR: Bacteriophage Bl] [DB :genpept-phg] [DE : Bacteriophage TP901-1 genomic 
region.] [NT : putative] [LE:<1] [RE:358] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID . , — 



LengthLength 



score 



IAI7503000983 26460951 c2 539 



probability 
] |533 | [4305 |[73T-j |263 |[917 ||b.0e-92 



Description 

pir: [LN:F69708] [AC:F69708 ] [PN :uridylate kinase smbA] [GN: smbA ] [CL:uridine 
5 * -monophosphate kinase] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI :ell85242 :g2634023] 
[LN:BSUB0009] [AC : Z99112 : AL009126] [PN : uridylate kinase] [GN: smbA] [FN:pyrimidine 
biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:2.7.4.-] [DE:Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [LE:120774] 
[RE: 121496] [DI rdirect] 



ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



IAI7503000983 26569432 F2 444 



] |534 | [4306 



55" 



Description 
NO-HIT 



ORF Name 



«. _ _ ^TT .^Vi^L 

NTID AAID — ._ — - score 

LengthLength 



probability 



117503000983 26586537 t2 376 



] [535 | [4307 |p3F~|[51 |[70 ||0.028 



Description 

gp: [GI:dl045213 :g5106116] [LN: AP000064] [AC : AP000064] [PN:125aa long hypothetical 
protein] [GN:APE2412] [OR : Aeropyrum pernix] [SR : Aeropyrum pernix ( strain :K1) DNA] 

[DB:genpept] [DE : Aeropyrum pernix genomic DNA, section 7/7.] [LE: 105148] [RE: 105525] 

[DI : direct] 



ORF Name 



AI75030009B3_2734778_iI_3i 

Description 

NO-HIT 



NTID 



„ TO NT AA 

AAID — — score 

LengthLength 

|4308 [[125 | |41 | 



probability 



196 



NT AA 

ORF Name NTID AAID Le ^ t h Length score probability 



Al7b03000983ja7b427_cij;44 | [537 | [4309 [pTT^T] |b73 | [1379 |p~5e- 141 



Description 

pir: [LN:H69884] [AC:H69884 ] [PN : conserved hypothetical protein ymfA] [GN:ymfA ] 
[CL: conserved hypothetical protein MG139] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85269:g2634050] [LN: BSUB0009] [AC : Z99112 : AL009126] [GNrymfA] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE : Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NTrsimilar to hypothetical proteins] [LE:150509] 
[RE : 152056] [DI : direct] 



NT AA 

ORF Name NTID AAID Le ^ thLe ~ th score probability 



AI7503000983_233i337_i2_25g | [538 | [ 4310 |[7gP — | [253 || 77i | 



i.5e-7S 



Description 

gp: [GI:g3800827] [LN : AF076684] [AC : AF076684] [PN : oligopeptide transporter putative 
ATPase domain] [GN:opp-2D] [OR : Staphylococcus aureus] [DB : genpept-bct2] 
[DE : Staphylococcus aureus oligopeptide transporter putative membranepermease domain 
(opp-2B) , oligopeptide transporter putativemembrane permease domain (opp-2C) , 
oligopeptide transporterputative ATPase domain (opp-2D) , and oligopeptide 
transporterputative ATPase domain (opp-2F) genes, complete cds . ] [LE:1966] [RE: 2742] 
[Dlidirect] 



NT AA 

ORF Name NTID AAID — _ — . score probability 
LengthLength 

AI9S030005fl3_2978i&68_c3_996 | |539 [ |43ii |p5~ 1 1 284 || 825 | |2.8e-82 



Description 

sp : [LN : GLPF_BACSU] [AC:P18156] [GN : GLPF] [OR:BACILLUS SUBTILIS] [DE: GLYCEROL UPTAKE 
FACILITATOR PROTEIN] [SP:P18156] [DB : swissprot] >pir : [LN : C47700] 

[AC:C47700 :A45868 :B69634 :S18563 ] [PN:glycerol uptake facilitator glpF] [GN : glpF ] 
[CL:glycerol facilitator protein] [OR:Bacillus subtilis] [DB:pir2] [MP:75 (degrees) ] 
>gp: [GI:gl42997] [LN: BACGLPPFK] [AC:M99611] [PN:glycerol uptake facilitator] [GN :glpF] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis DNA] [DB :genpept-bctl] [DE:Bacillus 
subtilis antiterminator regulatory protein (glpP) , glycerol uptake facilitator (glpF) 
genes, complete cds, glycerolkinase (glpK) gene, 5' end J [NT : putative] [LE:1085] 
[RE:1909] [DI:direct] >gp: [GI : ell82917 :g2633251] [LN: BSUB0005] [AC : Z99108 : AL009126] 
[PN:glycerol uptake facilitator] [GN:glpF] [FN:glycerol utilization] [OR:Bacillus 
subtilis] [DB : genpept-bct 1] [DE:Bacillus subtilis complete genome (section 5 of 21) : 
from 802821 tol011250.] [SP:P18156] [LE:199186] [RE:200010] [DI:direct] 
>gp: [GI:ell82929:g2633263] [LN: BSUB0006] [AC : Z99109 : AL009126] [PN:glycerol uptake 
facilitator] [GN: glpF] [FN:glycerol utilization] [OR:Bacillus subtilis] 
[DB : genpept-bct 1] [DE:Bacillus subtilis complete genome (section 6 of 21) : from 999501 
tOl209940.] [SP:P18156] [LE:2506] [RE:3330] [DI:direct] >gp : [GI : e324940 : g2226136] 
[LN:BSY14079] [AC:Y14079] [PN:putative glycerol uptake facilitator] [GN : glpF] 
[OR: Bacillus subtilis] [DB : genpept-bct 1] [DE: Bacillus subtilis chromosomal DNA, region 
75 degrees: glpPFKDoperon and downstream.] [NT: see EMBL M99611 and Swiss Prot P18156.] 
[SP:P18156] [LE:2154] [RE:2978] [DI:direct] 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength 



AI7503000583^2588g0ii_t2_248 | |540 | [4312 ||i35 1 144 | 



Description 
NO-HIT 



197 



ORF Name 



NT AA 
NT ID AAID „ — — ^, score 
Length Length 



probability 



IA17503000983 303bi>313 el 779 



j |541 | [4313 ||iili|p70 ||382 p.6e-35 



Description 

pir: [LN:C69901] [AC:C69901 ] [PNrprobable two -component sensor histidine kinase yocF] 
[GN:yocF ] [CL:probable Bacillus subtilis two-component sensor histidine kinase yocF] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : g2619013] [LN: AF027868] [AC : AF027868] 
[PNrsensor kinase] [GN : yocF] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 

subtilis chromosome region between terC and odhAB.] [NT: similar to B. subtilis DegS 

sensor kinase (385 aa) ] [LE:70568] [RE:71680] [DIrdirect] >gp : [GI : ell853 91 :g2634312] 
[LN:BSUB0011] [AC : Z99114 : AL009126] [GN : yocF] [FNiunknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE : Bacillus subtilis complete genome (section 11 of 21) : from 

2000171to 2207900.] [NT: similar to two-component sensor histidine kinase] [LE: 89659] 
[RE : 90771] [DI : direct] 



ORF Name 



NT ID AAID 



NT 



AA 



LengthLength 



E 



17503000983 30656317 c2 838 



][ 



[4314 ||822 1 1273 | 



score probability 
858 ||8.9e-8S 



J 



Description 

pir: [LN:S61496] [AC : S61496 : H696 01 ] [PN : transcription pleiotropic repressor codY] 
[GN : codY ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : g535351] [LN: BSU13634] [AC:U13634] 
[PN:CodY] [GN: codY] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
JH642 dipeptide permease operon regulators, codV,codW, codX, and codY genes, complete 
cds.] [LE:3225] [RE:4004] [DIrdirect] >gp : [GI : ell85208 : g2633989] [LN: BSUB0009] 
[AC:Z99112 :AL009126] [PN : transcriptional regulator] [GN : codY] [FN:negative regulation 
of srfA and comK genes (in] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [DE:Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [SP:P39779] 
[LE:91098] [RE:91877] [DIrdirect] 



ORF Name 



NT AA 
NT ID AAID — _ T — , , sco re 
LengthLength 



probability 



|Al7b03000983_31275_c2_927 



Description 
NO-HIT 



ORF Name 



NT ID AAID 



NT 



AA 



U7S03000983 31552 ci 755 



[4316 



LengthLength 
|52"2" 



T7T 



score probability 
197 ||9.9e-i6 



Description 

gp: [GI:dl045212:g5106115] [LN : AP000064 ] [AC : AP000064] [PN:197aa long hypothetical 
protein] [GN:APE2411] [OR: Aeropyrum pernix] [SR : Aeropyrum pernix (strain:Kl) DNA] 
[DB:genpept] [DE : Aeropyrum pernix genomic DNA, section 7/7.] [NT :motif =G-protein 
coupled receptors signature] [LE: 104943] [RE: 105536] [DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000983 31697151 c2 826 



|545 | [4317 |pT5~[|104 ||441 | 



1.4e-41 



Description 

sp: [LN:RL19_BACSU] [AC:031742] [GN : RPLS] [OR:BACILLUS SUBTILIS] [DE:50S RIBOSOMAL 
PROTEIN L19] [SP:031742] [DB : swissprot] >pir: [LN:E69696] [AC:E69696 ] [PN : ribosomal 
protein L19 rplS] [GN:rplS ] [CL : Escherichia coli ribosomal protein L19] [OR: Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell85195:g2 633976] [LN : BSUB0009] [AC : Z99112 : AL009126] 
[PN: ribosomal protein L19] [GN:rplS] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[SP:031742] [LE:77012] [RE:77368] [DIrdirect] 



198 



ORF Name 



NT ID 



AAID 



NT 



AA 



Length Length 



score probability 



117503000983 31803760 FT 9 



J |546 | pi8 |pi~||bO |p 1|0.031 



Description 

gp: [GI:el347527:g3877915] [LN:CEF58G1] [AC:Z81556] [GN:F58G1.9] [OR : Caenorhabditis 
elegans] [DB : genpept - invl ] [DE : Caenorhabditis elegans cosmid F58G1, complete sequence.] 

[NTrpredicted using Genefinder] [LE : 13299 : 13426 : 13549] [RE : 13379 : 13500 : 13656] 

[DI : direct Join] 



ORF Name 



[Ar7503u00983_3i844658_c3_987 



Description 
NO-HIT 

ORF Name 



IA17503000983 3235828 cl 795 



Description 
NO-HIT 



NT ID 



_ _ NT AA 

AAID — « _ — , . score 

LengthLength 



probability 



][ 



NTID 



|43i9 [|I38 | 



11 



^ tt ^ NT AA 

AAID — , — score 

LengthLength 



probability 



|4320 |p75 | p" 



ORF Name 



IAI7503000983 3257827 c2 858 



NTID 



NT 



AA 



AAID , — , — 



LengthLength 



score 



] |549 | [4321 ||234 | [Im- 



probability 
i.4e-32 



Description 

pir: [LN:H69885] [AC:H69885 ] [PN: 3-oxoacyl- acyl-carrier protein reductase homolog 

ymfl] [GN:ymfI ] [CL : short -chain alcohol dehydrogenase homology] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell85278 :g2634059] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:ymfI] 
[FNrunknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete 

genome (section 9 of 21): from 1598421to 1807200.] [NT:similar to 3-oxoacyl- 

acyl-carrier protein] [LE: 160607] [RE: 161335] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



E 



17503000983 32611557 c3 966 



Description 

sp : [LN : EFTS_BACSU] 

FACTOR TS (EF-TS) ] 



][ 



F5u~ 



[4322 



13TB" 



probability 
2.4e-28 



[AC:P80700:O31748] 
[SP:P80700 :031748] 



[GN:TSF] [OR: BACILLUS SUBTILIS] 
[DB: swissprot] 



[DE : ELONGATION 



ORF Name 



NTID 



IAI7503000983 33283167 c3 983 



][ 



F5T" 



NT AA 

AAID 

LengthLength 

[4323 |P^5~| p^ 



score probability 
967 ||2.5e-97 



Description 

pir: [LN:G69885] [AC:G69885 ] [PN : processing proteinase homolog ymfH] [GN:ymfH ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85277 : g2634058] [LN : BSUB0009] 
[AC:Z99112:AL009126] [GN:ymfH] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to processing protease] [LE:159305] [RE:160552] [DI:direct] 



ORF Name 



NTID 



AAID 



E 



17503000983 33375260 t2 273 



NT AA 
LengthLength 

165 i m — 



score probability 



Description 
NO-HIT 



199 



ORF Name 



IA17503000983 3394390 tl 48 



Description 
NO-HIT 



NT AA 
NT ID AAID , — , — , . scor e 
L en gth Length 



probability 



] I 553 I I 4325 II 123 ~| I 40 | 



ORF Name 



NTID 



____ NT AA 

AAID — _ — , . score 

Length Length 



[A17503000983 34070251 c3 1010 



][ 



[4326 |prro~[p99 |p^" 



probability 
9.3e-36 



Description 

pir: [LN:A70039] [AC:A70039 ] [PN:ABC transporter (ATP-binding protein) homolog yvfR] 
[GN : yvf R ] [CL : ATP-binding cassette homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell86097:g2635922] [LN:BSUB0018] [AC : Z99121 : AL009126] [GN:yvfR] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
18 of 21): from 3399551to 3609060.] [NT:similar to ABC transporter (ATP-binding 
protein)] [LE:97865] [RE:98770] [DI : complement] >gp : [GI : e313073 : gl945718] [LN : BSZ94043] 
[AC:Z94043] [PN : hypothetical protein] [GN : yvfR] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic DNA fragment (88 kb) . ] [NT:probable ABC 
transporter] [LE: 82901] [RE: 83806] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



[£17503000983 34195135 c2 896 



][ 



[4327 ||TTuT][367 |[T7TF 



probability 
7.8e-126 



Description 

Sp: [LN : THRC_BACSU] [AC:P04990] [GN: THRC] [OR:BACILLUS SUBTILIS] [EC : 4 . 2 . 99 . 2 ] 
[DE : THREONINE SYNTHASE,] [SP:P04990] [DB : swissprot] >pir : [LN : A2 5364] 
[AC:A25364 :B31973 :A69723 ] [PN : threonine synthase, thrC] [GN:thrC ] [CL : threonine 
dehydratase] [OR:Bacillus subtilis] [EC : 4 . 2 . 99 . 2] [DB:pir2] >gp : [GI:g40211] 
[LN: BSTHRBC] [AC:X04603] [PN : threonine synthase] [GN:thrC] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [EC : 4 . 2 . 99 . 2] [DE : B . subtilis thrB and thrC genes for homoserine 
kinase and threoninesynthase (EC 2.7.1.39 and EC 4.2.99.2, respectively).] [SP:P04990] 
[LE:248] [RE:1306] [Dlidirect] >gp : [GI :ell84304:g2 635722] [LN : BSUB0017] 
[AC:Z99120:AL009126] [PN : threonine synthase] [GN:thrC] [FN : threonine biosynthesis] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 4 . 2 . 99 . 2] [DE:Bacillus subtilis complete 
genome (section 17 of 21): from 3197001to 3414420.] [NT : alternate gene name: thrB] 
[SP:P04990] [LE: 115828] [RE: 116886] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _ — , , score 
Length Length 



probability 



IM7S03000983 34257817 c3 1038 



Description 
NO-HIT 



] 1 | 432fl || 123 || 40 | 



ORF Name 



NT AA 
NTID AAID — , — score 
LengthLength 



probability 



IAI7503000983 34571877 cl 746 



] |557 | [4329 |pT7F] |424 |p^||3.8e-46 



Description 

gp: [GI :g3426364] [LN : AF082738] [AC : AF082738 ] [PN:unknown] [OR : Streptococcus pyogenes] 
[DB:genpept-bct2] [DE : Streptococcus pyogenes phosphotidylglycerophosphate synthase 
(pgsA)and ABC transporter ATP-binding protein (stpA) genes, complete cds;and unknown 

genes.] [NT:orfl] [LE:1] [RE:1245] [DI:direct] 
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NT AA 

ORFJiame NTID AAID, Le ~ thLe ~ th score probability 



Al7503000983_34663177_c2Jj60 | |558 | [ 4330 | [mT||388 |pn~ | p.le-55 



Description 

gp: [GI :gl842440] [LN : BSU87792 ] [AC:U87792] [PN:CinA] [GNrcinA] [FNrputative 
competence -damage inducible function] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis tRNA-Ala, phosphatidylglycerophosphate synthase (pgsA) and CinA 
(cinA) genes, complete cds, and RecA (recA) gene,partial cds . ] [LE:6007] [RE:7258] 
[DI: direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|&I7S030009a3_3Si87Sa7_c3_i0SS | | [4331 |p3~| |i95 | p7u~|| 4 . 6e-34 

Description 

pir: [LN:S34747] [AC:S34747 ] [PN : anthranilate synthase, component II] [CL rglutamine 
amidotransf erase : trpG homology] [OR: Thermo toga maritima] [EC : 4 . 1 . 3 . 27] [DB:pir2] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A17503000983_353703i8_c2_834 | |560 | [4332 |pTT" | |309 || 1274 [ |7.4e-130 



Description 

pir: [LN:F69719] [AC:F69719 ] [PN : succinate- -CoA ligase (ADP- forming ) , alpha chain ] 
[GN:sucD ] [CL : succinate- -CoA ligase (ADP- forming) alpha chain] [OR:Bacillus subtilis] 
[EC:6.2.1.5] [DB:pir2] >gp: [GI : ell852 01 : g2633 982] [LN: BSUB0009] [AC : Z99112 : AL009126] 
[PN: succinyl-CoA synthetase (alpha subunit) ] [GN:sucD] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC:6.2.1.5] [DE:Bacillus subtilis complete genome (section 9 of 21): 
from 1598421to 1807200.] [SP:P80865] [LE:82596] [RE:83498] [DI:direct] 
>gp: [GI:e332178:g2462968] [LN : BSYLQGCOD] [AC : AJ000975] [PN:putative succinyl-coA 
synthetase alpha chain] [GN:sucD] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis ylqg to codV gene region.] [SP:P80865] [LE:1764] [RE: 2666] [DI: direct] 



ORF Name 


NTID 


AAID 


NT AA 
Le^thLe^th SCOre Probability 


AI7503000983_35557787_c3_1073 


P n 


| 4333 


1 1275 | 424 ||842 4.4e-84 | 



Description 

gp: [GI:g4982084] [LN : AE001799] [AC : AE001799 : AE000512] [PN : aspartokinase II] [GN:TM1518] 
[OR:Thermotoga maritima] [DB : genpept-bct2] [DE : Thermo toga maritima section 111 of 136 

of the complete genome.] [NT: similar to PID: 928811 SP:P53553 percent identity:] 
[LE : 15047] [RE : 16252] [DI : complement] 

NT AA 

ORF Name NTID AAID — — score probability 

LengthLength c 



Ai:7S03000S83_36i34401_c2_S97 | | [4334 | |fl3± ~] | 276 | |315 || 3.ie-28 



Description 

sp: [LN:YXEH_BACSU] [AC:P54947] [GN : YXEH : IP1B] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
30.2 KD PROTEIN IN IDH-DEOR INTERGENIC REGION] [SP:P54947] [DB : swissprot] 
>pir: [LN:B70075] [AC:B70075 ] [PN : conserved hypothetical protein yxeH] [GN:yxeH ] 
[CL:Methanobacterium thermoautotrophicum conserved hypothetical protein MTH1071] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84680 : g2636501] [LN : BSUB0021] 
[AC:Z99124:AL009126] [GN:yxeH] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 21 of 21): from 3999281to 4214814.] 
[NT: similar to hypothetical proteins] [SP:P54947] [LE : 63194] [RE: 64006] [DI : complement] 
>gp: [GI:dl008920:gl408493] [LN:D45912] [AC:D45912] [GN : yxeH] [OR:Bacillus subtilis] 
[SRrBacillus subtilis (strain:BGSC 1A1 (Marburg 168; trpC2) ) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis genome sequence between the iol and hut operon, partial and 
complete cds.] [NT : homologous to SwissProt : YIDA_ECOLI hypothetical] [LE:7470] [RE: 8282] 
[DI :direct] 
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NT AA 

0RF Name Sim Le ^thLe^th ^^ probability 

[A17503000983_J6142827_cl_814 | | 563 | [ 4335 || 181 T]|603 | |780 || l.fee-7V 

Description 

gp: [GI:dl014255:gl651216] [LN:D88209] [AC:D88209] [PN : Pz-peptidase] [OR:Bacillus 
licheniformis] [SR:Bacillus lichenif ormis (strain:N22) DNA] [DB : genpept - be t 1 ] 
[DE:Bacillus licheniformis DNA for Pz-peptidase, complete cds . ] [LE:238] [RE:2124] 
[DI : direct] 



ORF Name 



NTID 



NT AA 
AAID — , — _ score 
LengthLength 



^7503000983 36220051 c3 964 



probability 
1399 ||4.2e-i43 



Description 

sp: [LN:HSLU_BACSU] [AC:P39778] [GN : HSLU: CLP Y: CODX] [OR:BACILLUS SUBTILIS] [DErHEAT 
SHOCK PROTEIN HSLU] [SP:P39778] [DB : swissprot] >pir : [LN : E69601] 

[AC:E69601:S61495:S72310 ] [PN : ATP -dependent Clp proteinase-like protein clpYrcodX 
protein] [GN : clpY : codX ] [CLrheat shock protein hslU: FtsH/SEC18/CDC48- type ATP-binding 
domain homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g535350] [LN : BSU13634] 
[AC:U13634] [PN:CodX] [GN:codX] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis JH642 dipeptide permease operon regulators, codV,codW, codX, and codY genes, 
complete cds.] [LE:1782] [RE:3185] [DI: direct] >gp : [GI : ell85207 : g2633988] [LN : BSUB0009] 
[AC:Z99112:AL009126] [PN : ATP -dependent Clp protease-like] [GN:clpY] [ORrBacillus 
subtilis] [DB: genpept -bet 1] [DE:Bacillus subtilis complete genome (section 9 of 21) : 
from 1598421to 1807200.] [NT : alternate gene name: hslU, codX] [SP:P39778] [LE:89655] 
[RE: 91058] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — , , scor e 
LengthLength 



probability 



|A17b03000983 36225938 c2 833 



] I 565 I | 4337 | pre5-|| 394 [ | 1BS 7 || 7.6e-160 



1.5] 



Description 

sp: [LN:SUCC__BACSU] [AC:P80886] [GN:SUCC] [OR:BACILLUS SUBTILIS] [EC:6.2.1.5] 
[DE : (VEGETATIVE PROTEIN 63) (VEG63 ) ] [SP:P80886] [DB : swissprot] >pir : [LN : E69719] 
[AC:E69719 ] [PN : succinate- -CoA ligase (ADP - forming ) , beta chain] [GN:sucC ] 
[CL: succinate- -CoA ligase (ADP-f orming) beta chain] [OR:Bacillus subtilis] [EC:6.2 
[DB:pir2] >gp: [GI :ell85200:g2 633981] [LN : BSUB0009] [AC : Z99112 : AL009126] 
[PN: succinyl-CoA synthetase (beta subunit) ] [GN:sucC] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC:6.2.1.5] [DE:Bacillus subtilis complete genome (section 9 of 21) 
from 1598421to 1807200.] [SP:P80886] [LE: 81410] [RE:82567] [DI: direct] 
>gp: [GI:e332177:g2462967] [LN : BSYLQGCOD] [AC : AJ000975] [PN:putative succinyl-coA 
synthetase beta chain] [GN:sucC] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus 
subtilis ylqg to codV gene region.] [SP:P80886] [LE:578] [RE:1735] [DI:direct] 



ORF Name 



[AI7503000983 3913307 r3 640 



Description 
NO-HIT 



NT AA 
NTID AAID — _ „ — t - score 
LengthLength 



probability 



] 1 1 4338 || 135 || 44 1 



ORF Name 



IAI7503000983 3928177 T2 255 



NTID 



NT AA 
AAID — — score 
LengthLength 



SET" 



4339 11360 I UTS 



probability 
0.00081 



Description 

gp: [GI:dl039115:g4514335] [LN: AB013370] [AC : AB0133 70] [PN : YndE] [GN:yndE] [OR:Bacillus 
halodurans] [SR:Bacillus halodurans (strain : C- 125) DNA] [DB : genpept -bet 1] [DE:Bacillus 
halodurans C-125 yndF, gerKA, yndFl and yndF2 genes , partial and complete cds.] 
[LE:1025] [RE: 1876] [DI: direct] 
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NT AA 

QRF Name NTID AAI5 Le — thLe — th score probability 

Am03000983^3^7bbl_clJ*20 | [568 | [ 4340 | | 993 | [332 | p3 | | 6.0e-80 "| 



Description 

pir : [LN:B70461] [AC:B70461 ] [PN : aspartate- semialdehyde dehydrogenase,] [GN:asd ] 
[CL: aspartate -semi aldehyde dehydrogenase] [ORrAquifex aeolicus] [EC : 1 . 2 . 1 . 11] [DB:pir2] 
>gp: [GI :g2984139] [LN : AE000760] [AC : AE000760 : AE000657] [PN : aspartate- semialdehyde 
dehydrogenase] [GN:asd] [OR:Aquifex aeolicus] [DB : genpep t - be 1 2 ] [DE:Aquifex aeolicus 
section 92 of 109 of the complete genome.] [LE:7783] [RE: 8805] [DI:direct] 

NT AA 

QRF Name NTID AAID — — , , score probability 
Length Length c 



M:7S030005fl3_3Sgi702_ci_flli | [563 | [4341 ||12gg"||422 | [2245 ||3 . 5e-233 



Description 

pir: [LN: JC5325] [AC : JC5325 : PC4317 ] [PN : methicillin resistance factor FEMA] [GN:femA ] 
[CL:methicillin resistance factor femA] [OR : Staphylococcus epidermidis] [DB:pir2] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000983_4023;303_c2_923 | [570 | [4342 ||7g2~| [263 ||450 ||i.ge-42 



Description 

sp: [LN:TRPC_LACLA] [AC:Q01999] [GN : TRPC] [OR : LACTOCOCCUS LACTIS] 

[SR: ,SUBSPLACTIS: STREPTOCOCCUS LACTIS] [EC : 4 . 1 . 1 . 48] [DE : INDOLE- 3 -GLYCEROL PHOSPHATE 
SYNTHASE, (IGPS) ] [SP:Q01999] [DB : swissprot] >pir: [LN:S35127] [AC:S35127 ] 
[PN : indole- 3 -glycerol -phosphate synthase,] [GN:trpC ] [CL : indole- 3 -glycerol -phosphate 
synthase: trpC homology] [OR : Lactococcus lactis subsp. lactis] [EC : 4 . 1 . 1 . 48] [DB:pir2] 
>gp: [GI :gl49519] [LN: LACTRPOP] [AC:M87483] [PN: indoleglycerol phosphate synthase] 
[GN:trpC] [OR: Lactococcus lactis] [SR: Lactococcus lactis (strain IL1403, sub_species 
lactis) DNA] [DB :genpept-bctl] [EC : 4 . 1 . 1 . 48] [DE : L . lactis trpE, trpG, trpD, trpF, 
trpC, trpB trpA genes, completecds . ] [LE:4089] [RE: 4883] [DI:direct] 

NT AA 

QRF Name NTID AAID — _„ — , , score probability 
Length Length L - 



AI7503000983_40g2762_r3_bbb | [571 | [4343 [ | 405 ~| [134 | [STT5 — || i . le-i ^ 



Description 

gp: [GI:g3135292] [LN : AF029731] [AC : AF029731] [PN:large conductance mechanosensitive 
channel] [GN:mscL] [OR : Staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus 
aureus large conductance mechanosensitive channel (mscL) gene, complete cds . ] [NT :MscL] 
[LE:49] [RE:411] [DI:direct] 



NT AA 

QRF Name NTID AAID — , — , score probability 
Length Length 

AI7g03000983_4063802_ci_808 | [572 | [ 4344 | pTT| |498 | p3~ || i . 8e-80 



Description 

sp : [LN : TRPE_LACLA] [AC:Q02001] [GN : TRPE] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC : 4 . 1 . 3 . 2 7] [DE : ANTHRANILATE SYNTHASE 
COMPONENT I,] [SP:Q02001] [DB : swissprot] >pir : [LN: S35124] [AC:S35124 ] [PN : anthranilate 
synthase, alpha chain] [GN : trpE ] [CL : anthranilate synthase component I] 
[OR: Lactococcus lactis subsp. lactis] [EC : 4 . 1 . 3 . 27] [DB:pir2] >gp : [GI :gl49516] 
[LN : LACTRPOP] [AC : M87483] [PN : anthranilate synthase alpha subunit] [GN: trpE] 
[OR: Lactococcus lactis] [SR : Lactococcus lactis (strain IL1403, sub_species lactis) DNA] 
[DB:genpept-bctl] [EC : 4 . 1 . 3 . 27] [DE : L . lactis trpE, trpG, trpD, trpF, trpC, trpB trpA 
genes, completecds.] [LE:954] [RE: 2324] [DI: direct] 
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ORF Name 



NTID 



^ m NT AA 

AAID — _ — .. score 

LengthLength 



probability 



IA17b03000983 40686 el 73 7 



Description 

sp: [LN:DP03_STAAU] [AC : Q53665 : Q57110] [GN:POLC] [OR : STAPHYLOCOCCUS AUREUS] [EC:2.7.7.7] 
[DE : DNA POLYMERASE III, ALPHA CHAIN POLC-TYPE, (POLIII) ] [SP : Q53 665 : Q57110] 
[DBiSwissprot] >gp : [GI : dl013849 : gl483182] [LN:D86727] [AC : D86727 :D45368] [ PN : DNA 

polymerase III] [OR : Staphylococcus aureus] [SR: Staphylococcus aureus (strain :IP8) DNA, 

clone :pBpolC] [DB :genpept-bctl] [DE : Staphylococcus aureus DNA for DNA polymerase III, 

complete cds.] [LE:34] [RE: 4341] [DI: direct] 



ORF Name 



IAI7503000983 4059675 t2 274 



Description 
NO-HIT 



NTID 



][ 



AAID 



NT 



AA 



LengthLength 



score probability 



[4346 |[X£T~| p~ 



ORF Name 



NTID 



^ TrN NT AA 

AAID — — , , score 

LengthLength 



probability 



IA17503000983 4080342 c3 996 



][ 



[4347 ||1515||504 | [T5W]|1 . 4e-206 



Description 

sp: [LN:GLPK_BACSU] [AC:P18157] [GN : GLPK] [OR:BACILLUS SUBTILIS] [EC : 2 . 7 . 1 . 30] 
[DE: (GLYCEROKINASE) (GK) ] [SP:P18157] [DB : swissprot] >pir : [LN : B4 586 8 ] 
[AC:B45868:D47700:C69634:S18564 ] [PN:glycerol kinase, glpK] [GNrglpK ] 
[CL:xylulokinase] [OR:Bacillus subtilis] [EC : 2 . 7 . 1 . 30] [DB:pir2] [MP:75 (degrees) ] 
>gp: [GI:gl42992] [LN : BACGLPKD] [AC:M34393] [OR:Bacillus subtilis] [SR : B . subt ilis 
(strain W168) DNA] [DB :genpept-bctl] [DE :B . subtilis glycerol kinase (glpK) and 

glycerol- 3 -phosphatedehydrogenase (glpD) genes, complete cds.] [NT:glycerol kinase 
(glpK) (EC 2.7.1.30) ] [LE:698] [RE:2188] [DI : direct] >gp : [GI : ell82918 :g2633252] 
[LN:BSUB0005] [AC : Z99108 : AL009126] [PN:glycerol kinase] [GN : glpK] [FN:glycerol 

utilization] [OR: Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 1 . 30] [DE: Bacillus 

subtilis complete genome (section 5 of 21): from 802821 tol011250.] [SP:P18157] 
[LE:200029] [RE:201519] [DI:direct] >gp : [GI :ell82930:g2 633264] [LN : BSUB0006 ] 
[AC:Z99109:AL009126] [PN:glycerol kinase] [GN : glpK] [FN:glycerol utilization] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 1 . 30] [DE:Bacillus subtilis complete 

genome (section 6 of 21): from 999501 tol209940.] [SP:P18157] [LE:3349] [RE:4839] 
[DIrdirect] >gp : [GI : e324941 :g2226137] [LN: BSY14079] [AC:Y14079] [PN:glycerol kinase] 
[GN: glpK] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis chromosomal 

DNA, region 75 degrees: glpPFKDoperon and downstream.] [NT: see EMBL M34393 and Swiss 

Prot P18157.] [SP:P18157] [LE:2997] [RE:4487] [DIrdirect] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7503000983 4089062 c2 906 



][ 



[4348 |p77^"[|391 [|380 | jT 



probability 
.Oe-35 



Description 

pir : [LN:C71302] [AC:C71302 ] [PN:probable exonuclease] [GN:TP0626 ] [OR : Treponema 
pallidum subsp. pallidum] [SR:, syphilis spirochete] [DB:pir2] >gp : [GI : g3322921] 
[LN:AE001237] [AC : AE001237 : AE000520] [PN : exonuclease , putative] [GN:TP0626] 
[OR:Treponema pallidum] [DB :genpept-bct2] [DE : Treponema pallidum section 53 of 87 of 
the complete genome.] [NT: similar to SP:P23479 percent identity: 32.68/] [LE: 11246] 
[RE: 12421] [DI:direct] 
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ORF Name 



NT AA 
NT ID AAID — , _ — score 
LengthLength 



probability 



IA175030009BJ c2 859 



"| |577 | [4349 ||b9i~||i96 | ps~]| 4 . 7e-4i 



Description 

sp: [LN:PGSA_BACSU] [AC:P46322] [GN : PGSA] [OR : BACILLUS SUBTILIS] [EC:2.7.8.5] [DE:(EC 
2.7.8.5) (PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE) (PGP SYNTHASE)] [SP:P46322] 
[DB:swissprot] >gp: [GI : dl009402 : g893358] [LN : BACPGS1A] [AC:D50064] [PN:PgsA] [GNipgslA] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : Marburgl6 8 ) DNA] [DB : genpept-bctl] 
[EC: 2. 7. 8. 5] [DE: Bacillus subtilis pgslA gene for phosphatidylglycerophosphatesynthase , 
complete cds.] [LE:182] [RE:763] [Dlrdirect] >gp: [GI :gl842439] [LN : BSU87792] 
[AC:U87792] [PN : phosphatidylglycerophosphate synthase] [GNrpgsA] [FN: involved in the 
synthetic pathway for acidic] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis tRNA-Ala, phosphatidylglycerophosphate synthase (pgsA) and CinA (cinA) genes, 
complete cds, and RecA (recA) gene, partial cds.] [NT: PgsA] [LE:5407] [RE: 5990] 
[DI : direct] 



ORF Name 



NT ID 



™™ NT AA 

AAID — . T — , , score 

LengthLength 



probability 



3000983 4095286 c2 869 



] I 576 | | 4350 IP^~|| 124 IP^~I F 



.4e-2l 



Description 

pir: [LN:C69884] [AC:C69884 ] [PN : conserved hypothetical protein ymcA] [GN : ymcA ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85293 :g2634074] [LN: BSUB0009] 
[AC:Z99112:AL009126] [GN : ymcA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to hypothetical proteins] [LE:175322] [RE:175753] [Dlrdirect] 



ORF Name 



NT AA 

NT ID AAID — ~ — , , scor e 

LengthLength 



probability 



IM7503000983 4334383 E2 322 



] |579 | |435i | | 135 "| | 44 | 



Description 
NO-HIT 



ORF Name 



NT ID 



7V7VTT . NT AA 

AAID — . — . , score 

LengthLength 



1A17503000983 4336536 cl 739 



|4352 ||315 ||104 | pTl 



probability 
i.3e-17 



Description 

Sp : [LN : YLXR_BACSU] [AC:P32728] [GN : YLXR] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 10.4 

KD PROTEIN IN NUSA-INFB INTERGENIC REGION (ORF3)] [SP:P32728] [DB : swissprot] 
>pir: [LN:D36905] [AC : D36905 : D69882 : S31992 ] [PN: conserved hypothetical protein 

ylxR: hypothetical protein 1 (nusA 3' region)] [GN : ylxR ] [OR:Bacillus subtilis] 
[DB:pir2] >gp : [GI :g580900] [LN : BSORF1T7A] [AC:Z18631] [GN:ORF3] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE :B. subtilis infB-nusA operon.] [SP:P32728] [LE:2090] [RE:2365] 
[DI:direct] >gp: [GI :ell85252 :g2 634033] [LN: BSUB0009] [AC : Z99112 : AL009126] [GN :ylxR] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: ymxB; 

similar to hypothetical] [SP:P32728] [LE:134381] [RE:134656] [Dlrdirect] 



ORF Name 



AI7503000983 4414675 Cl 783 



Description 
NO-HIT 



NTID 

dee: 



^ TT% NT AA 

AAID — — t , score 

LengthLength 

[4353 ||132 | |43 | 



probability 
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ORF Name 



NT AA 
NT ID AAID — , « _ — sco re 
Length Length 



probability 



IAI7503000983 4425068 i2 474 



[582 | [4354 ||2613 | |B70 ||880 ||4.2e-85" 



Description 

pir: [LN:G69801] [AC:G69801 ] [PN : hypothetical protein yfhO] [GNryfhO ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell82850 :g2633184] [LN: BSUB0005] [AC : Z99108 : AL009126] 

[GN:yfhO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21): from 802821 tol011250.] [LE:128691] [RE:131150] 

[DI: direct] >gp: [GI : dl02 5397 : g2 8 04545] [LN:D85082] [AC:D85082] [PN:YfhO] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis DNA] [DB : genpept -bctl] [DE:Bacillus subtilis DNA, 
genome sequence, 79 to 81 degree region.] [LE: 21582] [RE: 24041] [DI: direct] 



ORF Name 



AI7503000983 4460063 £1 150 



Description 
NO-HIT 



ORF Name 



AI7503000983 44/326 C2 871 



Description 
NO-HIT 



NT ID 



NT AA 
AAID — , — , , score 
LengthLength 



probability 



J |583 | [4355 ||123 1 140 | 



NT AA 
NT ID AAID — — , , score 
LengthLength 

|584 | [4356 |[T55~[ |51 | 



probability 



ORF Name 



F 



17503000983 4493778 c3 952 



Description 
NO-HIT 



ORF Name 



117503000983 4572152 c3 970 



NT AA 

NTID AAID — _ — score 

LengthLength 



probability 



NTID 



^ TT ^ NT AA 
AAID — _ — , , score 
LengthLength 



probability 
] I 586 I | 4358 ||T2W||429 | |932 ||1.3e-93 



Description 

pir: [LN:C69881] [AC:C69881 ] [PN : conserved hypothetical protein yluC] [GN:yluC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85247 : g2634028] [LN : BSUB0009] 
[AC:Z99112 :AL009126] [GN:yluC] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT: similar to hypothetical proteins] [LE: 125000] [RE: 126268] [DI: direct] 



ORF Name 



IAI7503000983 4687825 c3 955 



Description 
NO-HIT 



NTID 



][ 



NT AA 

AAID , , 

LengthLength 

[4359 ||155 | [54 I 



score 



probability 



ORF Name 



E 



17503000983 4719011 c2 841 



Description 
NO-HIT 



NTID AAID — , — , score 
LengthLength 



probability 



"] [588 | [4350 |[XT^~| [57 



ORF Name 



AJ7503000983_4740932_ci_825 

Description 

NO-HIT 



NTID AAID — , — , score 
Length Length 



probability 



] 1 | 4361 H 126 | | 41 | 



206 



NT AA 

0RF Name *™ AAIP Le ^thLe^th ^^ probability 



AI7B030009B3_479B4I>3_cl_790 | [ 590 | [4362 | | 1026 | [341 | p?2~|| 2 . 7e-2 7 



Description 

pir: [LN:H69873] [AC:H69873 ] [PN : conserved hypothetical protein ylbC] [GNrylbC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e334771 :g2339999] [LN :BS16823KB] [AC:Z98682] 
[PNrYlbC protein] [GNrylbC] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus 
subtilis genomic DNA 23.9kB fragment.] [LE: 11510] [RE: 12550] [DI: direct] 
>gp: [GI : el 185086 :g2 633867] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN:ylbC] [FNiunknown] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21) : from 1394791to 1603020.] [NTrsimilar to hypothetical proteins from B. 
subtilis] [LE:170993] [RE:172033] [DI: direct] 



NT AA 

ORF Name NTID AAID — — , , score proba bility 
LengthLength c -L 



|AI7503000983 4884675 c3 1057 1 |55I ] [ 4363 |pTT] [ 403 | p6fl | | 3 -2e-129 



Description 

sp: [LN : TRPB_LACLA] [AC:Q01998] [GN: TRPB] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC : 4 . 2 . 1 . 20] [DE : TRYPTOPHAN SYNTHASE BETA 
CHAIN,] [SP:Q01998] [DB : swissprot] >pir : [LN : S35129] [AC:S35129 ] [PN : tryptophan 
synthase, beta chain] [GN: trpB ] [CL : tryptophan synthase beta chain : tryptophan synthase 
beta chain homology] [OR : Lactococcus lactis subsp. lactis] [EC : 4 . 2 . 1 . 20] [DB:pir2] 
>gp: [GI :gl49521] [LN: LACTRPOP] [AC:M87483] [PN : tryptophan synthase beta subunit] 
[GN : trpB] [OR: Lactococcus lactis] [SR : Lactococcus lactis (strain IL1403, sub_species 
lactis) DNA] [DB : genpept-bctl] [EC : 4 . 2 . 1 . 20] [DE : L . lactis trpE, trpG, trpD, trpF, 
trpC, trpB trpA genes, completecds . ] [LE:6514] [RE:7722] [DI:direct] 



NT AA 

ORF Name NTID AAID — _„ — , , score probability 

LengthLength 



AI7503000983_4890802_r 2_27S | |592 [ [4364 |p32"| [ 383 [ [ 819 || 1.2e-81 



Description 

sp: [LN:TYRA_BACSU] [AC:P20692] [GN : TYRA] [OR:BACILLUS SUBTILIS] [EC : 1 . 3 . 1 . 12] 
[DE : PREPHENATE DEHYDROGENASE, (PDH) ] [SP:P20692] [DB : swissprot ] 



NT AA 

ORF Name NTID AAID — _ — , , score probabi lity 
LengthLength c L - 



[AI7503000983_4891577_c2_879 | [593 | [ 4365 | | 582 "] |193 | p3~j| 7 . 4e-43 

Description 

pir: [LN:G69657] [AC:G69657 ] [ PN : tRNA isopentenylpyrophosphate transferase miaA] 
[GN:miaA ] [CL : delta (2) -isopentenylpyrophosphate transferase] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell83392 :g2634117] [LN : BSUB0010] [AC : Z 99113 : AL009126] [PN : tRNA 
isopentenylpyrophosphate transferase] [GN:miaA] [OR: Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 10 of 21) : from 
1781201to 2014980.] [LE:84430] [RE:85374] [DI:direct] 



NT AA 

ORF Name NTID AAID — _ T — , , score probabi lity 
LengthLength *- 



AI7503000983_4942202_c3_1058 | [594 | [4366 p 



Description 
NO-HIT 



207 



NT AA 

QRF Name 52™ AAIP Le — thLe — th score probability 



[AI75030005Bi_49fe4fe8fe_tJ>20 | | 595 | [436 7 | |204 |p || 195 || 1.6e-lb | 

Description 

pir: [LN:C70057] [AC:C70057 ] [PN : 4 -oxalocrotonate tautomerase homolog ywhB] [GNrywhB ] 
[CL:4-oxalocrotonate tautomerase] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:e267624 :gl565237] [LN:BSTHRZ] [AC:Z80360] [PN:Unknown, highly similar to 
Pseudomonas putida] [GN : ywhB] [OR: Bacillus subtilis] [DB : genpept -bet 1 ] [DE : B . subtilis 
thrZ downstream chromosomal region.] [LE:2388] [RE: 2576] [DI : complement] 
>gp: [GI:ell86254:g2636290] [LN: BSUB0020] [AC : Z99123 : AL009126] [GN: ywhB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 
20 of 21): from 3798401to 4010550.] [NT:similar to 4 -oxalocrotonate tautomerase] 
[LE:54138] [RE:54326] [DIrdirect] 

NT AA 

QRF Name NTID AAID — — , _ score probability 
LengthLength c *- 



AlV!>030009fl3_B10937fl_t3_S46 | | 596 | [ 4358 || 6i5 — 1 1 204 |pT~ 



i.Jie-44 



Description 

pir: [LN:A69892] [AC:A69892 ] [PN : conserved hypothetical protein yneS] [GN:yneS ] 
[CL: Escherichia coli ygiH protein] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:e249655:gl405459] [LN : BC170DEGR] [AC:Z73234] [PN:YneS] [GN:yneS] [OR:Bacillus 
subtilis] [DB: genpept -bet 1] [DE :B . subtilis DNA (26.2 kb fragment; 170 degree region).] 
[NT: similar to hypothetical protein MG247 from] [LE : 13596] [RE: 14177] [DI : complement] 
>gp: [GI:ell83465:g2634190] [LN : BSUB0010] [AC : Z99113 : AL009126] [GN:yneS] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis complete genome (section 
10 of 21): from 1781201to 2014980.] [NT:similar to hypothetical proteins] [LE:149975] 
[RE : 150556] [DI : complement] 



NT AA 

QRF Name NTID AAID — — , , score probability 
LengthLength c 



AI7503000983_Bi0952S_c3_10il | |597 | [ 4369 |p^~ 1 1 123 | p5B — 1 |3 . 4e-22 



Description 

pir: [LN:D70039] [AC:D70039 ] [PN : two -component response regulator [YvfT] homolog yvfU] 
[GN:yvfU ] [CL: regulatory protein comA: response regulator homology] [OR: Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell86094 :g2635919] [LN : BSUB0018] [AC : Z99121 : AL009126] 
[GN:yvfU] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis 
complete genome (section 18 of 21): from 3399551to 3609060.] [NT:similar to 
two-component response regulator [YvfT]] [LE: 95389] [RE: 95991] [D I : complement] 
>gp: [GI:e313075:gl945721] [LN:BSZ94043] [AC:Z94043] [PN : hypothetical protein] [GN:yvfU] 
[OR:Bacillus subtilis] [DB : genpept -bctl] [DE : B . subtilis genomic DNA fragment (88 kb) . ] 
[NT:probable two component regulatory system:] [LE: 85680] [RE: 86282] [DI:direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AI7S03000983_SiS0fi3S_c2_903 [ |598 | [4370 |pu~ [ | 79 |[7 6 || 0.02g 



Description 

sp: [LN:F801_SCHMA] [AC:P16463] [OR : SCHISTOSOMA MANSONI] [SR:, BLOOD FLUKE] [DE : FEMALE 
SPECIFIC 800 PROTEIN (FS800) ] [SP:P16463] [DB : swissprot] >gp : [GI : gl60990] [LN : SCMFS800] 
[AC:J03999] [PN : female- specif ic 800 protein] [GN:fs800] [OR : Schistosoma mansoni] 
[SR: Schistosoma mansoni (strain Puerto Rican) cDNA to mRNA] [DB : genpept -invl] 
[DE: Schistosoma mansoni female-specific 800 protein (fs800) mRNA, complete cds . ] 
[NT:putative] [LE:4] [RE:720] [DIrdirect] 



208 



NT AA 

QRF Name NTID AAID Le — thLe — th score probability 



Ar 7 503000983_bl9bJ28_ciJ 7 49 | | 599 | [ 4371 || 393 1 1 130 || 142 | p.3e- W 



Description 

gp: [GI :gl842438] [LN : BSU87792 ] [AC:U87792] [PNrunknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis tRNA-Ala, phosphatidylglycerophosphate 
synthase (pgsA) and CinA (cinA) genes, complete cds, and RecA (recA) gene, partial cds . ] 
[NT:ORF307; hypothetical 34.7 kd protein] [LE:4436] [RE:5359] [Dlidirect] 

NT AA 

QRF Name NTID AAID Le ^ t fr Le ^ th score probability 



^7503000983 5198557 ci 734 



[4372 |pn~|pgg ||847 ||TTTe^r 



Description 

gp: [GI:dl032955:g3358087] [LN: AB004319] [AC : AB004319] [PN : undecaprenyl diphosphate 
synthase] [GN:upps] [OR micrococcus luteus] [SR micrococcus luteus (strain:B-P 26) DNA] 
[DB: genpept-bctl] [DE micrococcus luteus DNA for undecaprenyl diphosphate 
synthase, complete cds.] [LE:1] [RE: 750] [DI:direct] 

NT AA 

QRF Name NTID AAID — ^ w — ^ score probability 
Length Length c 



IAI7503000983. 5212776 c2 832 J J601 | [43 7 3 |p7^[ | 288 | p39 | | 9.2e-84 



Description 

pir: [LN:F69880] [AC:F69880 ] [PN : conserved hypothetical protein ylqF] [GN:ylqF ] 
[CL: conserved hypothetical protein MG442] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85196:g2633977] [LN : BSUB0009] [AC : Z 9 9112 : AL009126] [GN:ylqF] [FN:unknovm] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [LE:77511] 
[RE: 78359] [DI: direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



IAI7503000983 5355012 cl 770 



4374 111359 I IT52 112274 ||7.9e-236 



Description 

sp: [LN:GLNA_STAAU] [AC:Q59812] [GN : GLNA] [OR : STAPHYLOCOCCUS AUREUS] [EC:6.3.1.2] 
[DE : GLUT AMINE SYNTHETASE, ( GLUT AMATE- -AMMONIA LIGASE) (GS) ] [SP:Q59812] [DB : swissprot] 
>gp: [GI:e214721:gll34886] [LN : SAGLNAR] [AC:X76490] [PN:glutamine synthetase] [GN:glnA] 
[OR staphylococcus aureus] [DB : genpept-bctl] [DE:S. aureus (bb270) glnA and glnR genes.] 
[SP:Q59812] [LE:1362] [RE:2702] [DI: direct] 

NT AA 

QRF Name NTID AAID — _ , — , , score probabi lity 
LengthLength c 



|AI7503000983_553455_t3_540 

Description 
NO-HIT 



603 | [4375 ||I53~||S0 | 



209 



NT AA 

ORFJTame NTID AAID ^^^-^ score probability 



[AI 7 5030009BJ>^babtm_cl_B02 | | b04 | [43 7 6 |p7U"] | 489 ||I3P5"] |3 ■ 8e-133 



Description 

sp: [LN:ALST_BACSU] [AC:Q45068] [GN: ALST] [OR: BACILLUS SUBTILIS] [DE: AMINO ACID CARRIER 
PROTEIN ALST] [SP:Q45068] [DB : swissprot] >pir : [LN: A69585] [AC:A69585] [PN: amino acid 
carrier protein alsT] [GN : alsT ] [CL : sodium-dependent D-alanine/glycine transport 
protein] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e249660 : gl405464] [LN : BC170DEGR] 
[AC:Z73234] [PN:AlsT] [GN: alsT] [FN : aminoacid carrier protein] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE : B . subtilis DNA (26.2 kb fragment; 170 degree region).] 
[NT: similar to sodium/proton dependent alanine carrier] [SP:Q45068] [LE: 20601] 
[RE:21998] [DI:direct] >gp : [GI :ell83470:g2 634195] [LN : BSUB0010] [AC : Z99113 : AL009126] 
[PN:amino acid carrier protein] [GN : alsT] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 10 of 21): from 1781201to 2014980.] 
[SP:Q45068] [LE:156980] [RE:158377] [DI:direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|A:t7503000983j541gggg_ci_800 | [505 | [4377 |pHT| p6 |pTFTT| |0.Q 

Description 

sp : [LN : PARE_STAAU] [AC:P50072] [GN : PARE : GRLB] [OR : STAPHYLOCOCCUS AUREUS] [EC : 5 . 99 . 1 . - ] 
[DE:TOPOISOMERASE IV SUBUNIT B, ] [SP:P50072] [DB : swissprot] >pir: [LN:S54426] [AC:S54426 

] [ PN : DNA topoisomerase ( ATP -hydroly zing) , chain B] [CL :DNA topoisomerase 
(ATP-hydrolyzing) chain B] [OR : Staphylococcus aureus] [EC : 5 . 99 . 1 . 3] [DB:pir2] 

>gp: [GI:dl011746 :gl777320] [LN:D67075] [AC:D67075] [PN: DNA topoisomerase IV GrlB 

subunit] [GN : grlB] [OR : Staphylococcus aureus] [SR: Staphylococcus aureus 
(isolate :RN4220) DNA] [DB : genpept-bctl] [DE : Staphylococcus aureus DNA for DNA 

topoisomerase IV GrlB subunit, DNA topoisomerase IV GrlA subunit, complete cds . ] 
[LE:385] [RE:2376] [DI:direct] >gp : [GI :g561879] [LN: STAGYRASL] [AC:L25288] 
[PN:gyrase-like protein beta subunit] [GN :grlB] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (tissue library: FDA 574) DNA] [DB : genpept-bctl] 
[DE: Staphylococcus aureus gyrase-like protein alpha and beta subunit (grlA and grlB) 

genes, complete cds.] [LE:41] [RE:2032] [DI:direct] >gp : [GI : e306312 : g2302281] 
[LN:A48501] [AC:A48501] [OR : Staphylococcus aureus] [DB : genpept-pat] [DE : Sequence 3 from 

Patent WO9603516.] [NT:unnamed protein product] [LE:1] [RE: 1992] [DI: direct] 



ORF Name 


NT AA 

NTID AAID L — — th score probability 


AI7503000983_6S25_t3_S77 


| | |4378 ||1035||344 |p9 ||4.2e-72 



Description 

sp: [LN:LYSP_ECOLI] [AC:P25737] [GN : LYSP : CADR] [OR : ESCHERICHIA COLI] [DE : LYSINE-SPECIFIC 
PERMEASE] [SP : P25737] [DB : swissprot] 



210 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IA17503000983 6641963 c3 1009 



] | 507 1 | 4375 ||i48b||454 ||1J02 ||8.0e-133 



] 



Description 

sp: [LN:YWNE_BACSU] [AC:P71040] [GN : YWNE] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 55.8 
KD PROTEIN IN SPOIIQ-MTA INTERGENIC REGION] [SP:P71040] [DB : swissprot] >pir : [LN:G70063] 
[AC:G70063 ] [PN : cardiolipin synthase homolog ywnE] [GNrywnE ] [CL: Bacillus probable 
cardiolipin synthetase] [ORrBacillus subtilis] [DB:pir2] >gp r [GI r ell84565 r g2636184] 
[LN:BSUB0019] [AC r Z99122 r AL009126] [GNrywnE] [FNrunknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DErBacillus subtilis complete genome (section 19 of 21) : from 
3597091to 3809700.] [NTrsimilar to cardiolipin synthase] [SP:P71040] [LE:164628] 
[RE:166076] [DI: direct] >gp : [GI : e269549 :gl592701] [LN : BSUEROP] [AC:Y08559] [PN:Unknown] 
[GNrywnE] [ORrBacillus subtilis] [DB rgenpept-bctl] [DE : B . subtilis urease operon and 
downstream DNA. ] [NT : Product similar to Escherichia coli cardiolipin] [SPrP71040] 
[LE:5155] [REr6603] [DI r complement] >gp: [GI r ell84565 r g2636184] [LNrBSUB0019] 
[ACrZ99122 rAL009126] [GNrywnE] [FNrunknown] [ORrBacillus subtilis] [DBrgenpept] 
[DErBacillus subtilis complete genome (section 19 of 21) r from 3597091to 3809700.] 
[NTrsimilar to cardiolipin synthase] [SPrP71040] [LErl64628] [RErl66076] [DIrdirect] 



ORF Name 



IAI7503000983 6S64127 c2 940 



Description 
NO-HIT 



„ mTr , NT AA 

NTID AAID — _^ — . score 

LengthLength 



probability 



j |608 | [4380 ||438 | |145 | 



ORF Name 



NTID 



NT 

AAID . 



AA 



LengthLength 



score 



I&I7S03000 983 6578140 c3 972 



probability 
3.8e-28 



1 



Description 

pirr [LNrS52267] [ACrS52267 ] [ PN r DNA polymerase III] [CL r DNA-directed DNA polymerase 
III alpha chain polC] [OR r Staphylococcus aureus] [DBrpir2] >gp r [GI r g642270] 
[LN r SADNAPOL3 ] [AC : Z48003 r L39156] [ PN r DNA polymerase III] [OR r Staphylococcus aureus] 
[DB rgenpept-bctl] [DErS. aureus gene for DNA polymerase III.] [SPrQ53665] [LE r 34] 
[REr4281] [DI rdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



— , score probability 



[AI7503000983 6688125 cl 751 



[4382 ||107^][3B7 | pSTj|7 . 2e-171 



Description 

Sp r [LN r RECA_STAAU] [ACrQ023 50] [GNrRECA] [OR r STAPHYLOCOCCUS AUREUS] [DErRECA PROTEIN] 
[SPrQ02350] [DB r swissprot] >gp r [GI rg463285] [LN r STARECAA] [ACrL25893] [GNrrecA] 
[FNrgenetic recombination] [OR r Staphylococcus aureus] [SR r Staphylococcus aureus DNA] 
[DB rgenpept-bctl] [DE r Staphylococcus aureus recA gene, complete cds.] [NT r putative] 
[LE r 16] [RErl059] [DIrdirect] 



ORF Name 



_ ___ _ NT AA 

NTID AAID — _„ — , , score 

LengthLength 



probability 



AI7503000983_673437_i3_S29 

Description 

NO-HIT 



211 



NT AA 

ORF Name NTID AAID Le ^ t h Le ^ t ft scor e probability 



AI7S03000983_681BB27_c3_1072 | |g!2 | [ 4384 | | 160B | |535 | [2088 || 4 . ie- TTS" 



Description 

pir : [LN:E69861] [AC:E69861 ] [PN:ABC transporter (ATP-binding protein) homolog ykpA] 
[GN : ykpA ] [CL : ATP-binding cassette homology] [ORrBacillus subtilis] [DB:pir2] 

>gp: [GI:ell85033:g2633814] [LN : BSUB0008] [AC : Z99111 : AL009126 ] [GNrykpA] [FN:unknovm] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 

8 of 21): from 1394791to 1603020.] [NT:similar to ABC transporter (ATP-binding 

protein)] [LE:116988] [RE:118610] [DI: direct] >gp : [GI :g3282128] [LN: AF012285] 
[AC:AF012285:AF012284:U51911] [ PN : YkpA] [GNrykpA] [ORrBacillus subtilis] 
[DB r genpept-bct2] [DErBacillus subtilis mobA-nprE gene region.] [NTrsimilar to E. coli 

hypothetical ABC transporter] [LE r 17476] [RE r 19098] [DIrdirect] 



NT AA 

ORF Name NTID AAI D — , — , score probability 
Length Length L - 



A17b030009B3_fcB37B12_cl_73B | |613 | |438b |(7^~| |261 | pi | | 1.3e-59 ~| 



Description 

sp r [LN r CDSA_BACSU] [ACr031752] [GNrCDSA] [OR r BACILLUS SUBTILIS] [EC r 2 . 7 . 7 . 41] 
[DE r SYNTHASE ) ] [SP:031752] [DB : swissprot] >pir : [LN : G69597] [AC:G69597 ] 
[PNrphosphatidate cytidylyltransf erase cdsA] [GNrcdsA ] [ORrBacillus subtilis] 
[DBrpir2] >gpr [GI rell85245rg2 634026] [LN r BSUB0009] [AC r Z99112 r AL009126] 
[PNrphosphatidate cytidylyltransf erase] [GNrcdsA] [FN r phospholipid biosynthesis] 
[ORrBacillus subtilis] [DB r genpept-bctl] [EC r 2 . 7 . 7 . 41] [DErBacillus subtilis complete 
genome (section 9 of 21) r from 1598421to 1807200.] [SPr031752] [LErl22972] [RErl23781] 
[DI r direct] 

NT AA 

ORF Name NTID AAID — _ — , sco re probability 

LengthLength -L 



AI7S030009B3_6929552_ci_B22 | |g!4 | [ 4386 |[7^T"^ |247 | pi~ || 3 . le-6 7 



Description 

pir r [LNrF69866] [ACrF69866 ] [PN r tetrahydrodipicolinate succinylase homolog ykuQ] 
[GNrykuQ ] [ORrBacillus subtilis] [DBrpir2] >gp r [GI r ell81922 rg2632238] [LN r BS16829KB] 
[ACrAJ222587] [PN r YkuQ protein] [GNrykuQ] [ORrBacillus subtilis] [DB r genpept-bctl] 
[DErBacillus subtilis 29kB DNA fragment from ykwC gene to csel5 gene.] [NT r homologous 
to acetyltransferases] [LEr23332] [REr24042] [DIrdirect] >gp r [GI r ell85008 rg2633789] 
[LNrBSUB0008] [AC r Z99111 r AL009126] [GNrykuQ] [FNrunknown] [ORrBacillus subtilis] 
[DBr genpept-bctl] [DErBacillus subtilis complete genome (section 8 of 21) r from 
1394791to 1603020.] [NTrsimilar to tetrahydrodipicolinate succinylase] [LE r 93588] 
[RE r 94298] [DI r direct] 

NT AA 

ORF Name NTID AAID — — , . score probabilit y 
LengthLength c L - 



|AI7S030005fl3_g325g77_ci_7gg | |gi5 | [4387 |85 || 187 | | i.ie-14 

Description 

pirr [LNrB69884] [ACrB69884 ] [PNrhost factor- 1 protein homolog ymaH] [GNrymaH ] 
[CLrhost factor I] [ORrBacillus subtilis] [DBrpir2] >gpr [GI r ell83393 rg2634118] 
[LN r BSUB0010] [AC r Z99113 r AL009126] [GNrymaH] [FNrunknown] [ORrBacillus subtilis] 
[DB r genpept-bctl] [DErBacillus subtilis complete genome (section 10 of 21) r from 
1781201to 2014980.] [NTrsimilar to host factor-1 protein] [LEr85414] [REr85635] 
[DIrdirect] 
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NT AA 

ORF Name NTID AAID Le ^ t frLength SCQre probability 



AI7503000983J>933390_cl_759 | [ 515 | [ 4388 |[552~] | 183 |pnT"]| l. 9e-37 



Description 

sp: [LN:GLPP_BACSU] [AC:P30300] [GN : GLPP] [OR : BACILLUS SUBTILIS] [DE : GLYCEROL UPTAKE 
OPERON ANT I TERMINATOR REGULATORY PROTEIN] [SP:P30300] [DB : swissprot] >pir : [LN:B47700] 
[AC:B47700:D69634 ] [PN:glycerol metabolism regulatory protein GlpP] [GNrglpP ] 
[ORrBacillus subtilis] [DB:pir2] [MP:75 (degrees) ] >gp : [GI :gl42996] [LN: BACGLPPFK] 
[AC :M99611] [PN : regulatory protein] [GNiglpP] [FNrputative antiterminator] [OR:Bacillus 
subtilis] [SR: Bacillus subtilis DNA] [DB : genpept-bctl] [DE: Bacillus subtilis 
antiterminator regulatory protein (glpP) , glycerol uptake facilitator (glpF) genes, 
complete cds, glycerolkinase (glpK) gene, 5' end.] [LE:328] [RE:906] [DI: direct] 
>gp: [GI:ell82916:g2633250] [LN : BSUB0005] [AC : Z99108 : AL009126] [PN : transcription 
antiterminator] [GNrglpP] [FN: control of mRNA stability of glpD] [OR: Bacillus subtilis] 
[DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 5 of 21) : from 802821 
tol011250.] [SP:P30300] [LE:198429] [RE:199007] [DI:direct] >gp : [GI:ell82928:g2 633262] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [PN : transcription antiterminator] [GN:glpP] 
[FN:control of mRNA stability of glpD] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[SP:P30300] [LE:1749] [RE:2327] [DI:direct] >gp : [GI : e324939 : g222613 5] [LN : BSY14079] 
[AC:Y14079] [PN : regulatory protein] [GN:glpP] [FN:putative antiterminator] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE: Bacillus subtilis chromosomal DNA, region 75 degrees: 
glpPFKDoperon and downstream.] [NT: see EMBL M99611 and Swiss Prot P30300.] [SP:P30300] 
[LE:1397] [RE:1975] [DI: direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|AI7503000983_7064077_cl 740 J |617 | [4389 ||21B7 1 1728 | pb08 | |1 . 3e-260 



Description 

sp: [LN:IF2_BACSU] [AC : P17889 : 03 1757] [GN: INFB] [OR:BACILLUS SUBTILIS] [DE : TRANSLATION 
INITIATION FACTOR IF-2] [SP : P1788 9 : 031757] [DB : swissprot] >pir : [LN : A35269] 
[AC:A35269:B35269:S31994:G69644 ] [PN: translation initiation factor IF-2] [GN:infB ] 
[CL: translation initiation factor IF-2 : translation elongation factor Tu homology] 
[0R:Bacillus subtilis] [DB:pirl] >gp : [GI : gl43359] [LN : BACPSIF2A] [AC:M34836] 
[0R:Bacillus subtilis] [SR :B . subtilis (strain RS410) DNA, clones lambda- JET [1, 2] ,pUK, 
an] [DB : genpept-bctl] [DE : B . subtilis protein synthesis initiation factor 2 (infB) 
gene, complete cds . ] [NT:protein synthesis initiation factor 2 (infB)] [LE:381] 
[RE:2531] [DI:direct] >gp : [GI : g49319] [LN : BS0RF1T7A] [AC:Z18631] [GN:IF2] [0R:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis inf B-nusA operon . ] [SP:P17889] [LE:2689] 
[RE : 483 9] [DI : direct] 



NT AA 

ORF Name NTID AAID — — j , score probabilit y 
LengthLength c -L 



AI76030009B3J7203i7fi_ci_810 | |6ifl | [ 4390 |[SOT~j |266 | | 426 | |S.4e-40 



Description 

sp: [LN:TRPA_METJA] [AC:Q60180] [GN : TRPA :MJ1038] [OR : METHANOCOCCUS JANNASCHII] 
[EC:4.2.1.20] [DE : TRYPTOPHAN SYNTHASE ALPHA CHAIN,] [SP:Q60180] [DB : swissprot] 
>pir: [LN:E64429] [AC:E64429 ] [PN : tryptophan synthase, alpha chain] [CL : tryptophan 
synthase alpha chain : tryptophan synthase alpha chain homology] [OR : Methanococcus 
jannaschii] [EC : 4 . 2 . 1 . 20] [DB:pir2] [MP : FOR969735- 970589 ] >gp : [GI :gl591691] 
[LN:U67546] [AC :U67546 : L77117] [PN : tryptophan synthase alpha subunit (trpA) ] 
[GN:MJ1038] [OR : Methanococcus jannaschii] [DB :genpept-bct2] [DE : Methanococcus 
jannaschii section 88 of 150 of the complete genome.] [NT: similar to GB:M65060 
SP:P26920 PID:149750] [LE:5837] [RE:6691] [DIrdirect] 
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ORF Name 



NT AA 
NTID AAID — _ — score 
Length Length 



probability 



IAI7S03000983 724221.0 c2 924 



] | 515 | I 4391 || 581 IP 25 IP 06 |p^"8e-27 



Description 

gp: [GI:g5002553] [LN: AF074603] [AC : AF074603 ] [PNrNonF] [GNmonF] [OR : Streptomyces 
griseus subsp. griseus] [DB : genpept-bct2] [DE : Streptomyces griseus subsp. griseus 
nonactin biosynthesis genecluster, partial sequence.] [LE: 12384] [RE: 13088] [DI:direct] 



ORF Name 



IAI7503000983 801552 t2 228 



NT AA 
NTID AAID — — , . score 
LengthLength 

J po | [4392 |pn~||i25 | [73 ||0.0ii 



probability 



Description 

gp: [GI:g2454643] [LN: AF020905] [AC : AF020905] [PN:E5] [GN:E5] [OR : common chimpanzee 
papillomavirus 1] [DB : genpept - vrl ] [DE:Common chimpanzee papillomavirus 1, complete 
genome.] [LE:3901] [RE:4185] [DI:direct] 



ORF Name 



LAI7503000983 835252 ti 13 



Description 
NO-HIT 



NTID 



NT AA 
AAID — » — . score 
LengthLength 



probability 



] | I 4393 | 



1ST" 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000983 837550 c2 914 



Description 
NO-HIT 



J [622 | [4394 ||246 1 181 | 



ORF Name 



NT 



AA 



NTID AAID — _ — , , score probabi lity 
LengthLength ^~ 



IAI7503000983 859052 c2 876 



J [523 | [4395 |pT7T] [558 | |T7uTr||5 . 3e-175 



Description 

Sp : [LN : GLPD_BACSU] [AC:P18158] [GN : GLPD] [OR:BACILLUS SUBTILIS] [EC : 1 . 1 . 99 . 5] 
[DE : AEROBIC GLYCEROL- 3 -PHOSPHATE DEHYDROGENASE, ] [SP:P18158] [DB : swissprot] 
>pir: [LN:C45868] [AC : C45868 : A69634 : S18565 ] [PN: glycerol -3 -phosphate dehydrogenase, 
glpD] [GN:glpD ] [OR:Bacillus subtilis] [EC : 1 . 1 . 99 . 5] [DB:pir2] >gp : [GI :gl42993] 
[LN : BACGLPKD] [AC:M34393] [OR:Bacillus subtilis] [SR : B . subtilis (strain W168) DNA] 
[DB:genpept-bctl] [DE :B. subtilis glycerol kinase (glpK) and 

glycerol - 3 -phosphatedehydrogenase (glpD) genes , complete cds . ] [NT : glycerol - 3 -phosphate 
dehydrogenase (glpD) (EC] [LE:2329] [RE:3996] [DI: direct] >gp : [GI : ell82919 :g2633253] 
[LN:BSUB0005] [AC : Z99108 : AL009126] [PN: glycerol -3 -phosphate dehydrogenase] [GN: glpD] 
[FN:glycerol utilization] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 1 . 1 . 99 . 5] 
[DE:Bacillus subtilis complete genome (section 5 of 21): from 802821 tol011250.] 
[SP:P18158] [LE:201660] [RE:203327] [DI: direct] >gp : [GI : el 182931 :g2 633265] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [PN : glycerol - 3 -phosphate dehydrogenase] [GN: glpD] 
[FN:glycerol utilization] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 1 . 1 . 99 . 5] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[SP:P18158] [LE:4980] [RE:6647] [DIrdirect] >gp : [GI : e324 942 : g2226138] [LN: BSY14079] 
[AC:Y14079] [PN : glycerol - 3 -phosphate dehydrogenase] [GN:glpD] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE: Bacillus subtilis chromosomal DNA, region 75 degrees: 
glpPFKDoperon and downstream.] [NT: see EMBL M34393 and Swiss Prot P18158.] [SP:P18158] 
[LE:4628] [RE:6295] [DIrdirect] 
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ORF Name 



NTID 



NT AA 
AAID — , _ — A , score 
LengthLength 



probability 



[AI 7 503000983 891700 c2 965 



] |524 | [4296 ||867 | |288 ||599 ||2.5e- SB- 



Description 

pir: [LN:S22397] [AC:S22397 ] [PNipyruvate synthase, beta chain] [CLrpyruvate synthase 
beta chain] [OR : Halobacterium halobium] [EC:1.2.7.1] [DB:pir2] >gp : [GI : g43499] 
[LN : HHFEROXI ] [AC:X64521] [PN : f erredoxin oxidoreductase] [OR : Halobacterium halobium] 
[DB:genpept-bctl] [EC:1.2.7.1] [DE :H . halobium gene for pyruvate : f erredoxin 
oxidoreductase.] [NT : beta- subuni t ; pyruvate synthase] [LE:2057] [RE: 2995] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — _ score 
LengthLength 



probability 



IAI7503000983 852186 tl 171 



] I 625 I I 4397 |p^~||42 | 



Description 
NO-HIT 

ORF Name 



NTID 



WA „ NT AA 
^ LengthLength 



F 



7503000983 893826 t2 384 



[4398 |p.23 | 



probability 
uTOTT 



Description 

pir: [LN:D64600] [AC:D64600 ] [PN : conserved hypothetical integral membrane protein 
HP0644] [OR: Helicobacter pylori] [DB:pir2] >gp : [GI :g2313764] [LN : AE000578] 
[AC:AE000578 :AE000511] [PN: conserved hypothetical integral membrane] [GN:HP0644] 
[OR Helicobacter pylori 26695] [DB :genpept-bct2] [DE : Helicobacter pylori 26695 section 
56 of 134 of the complete genome.] [NT:similar to SP:P25254 percent identity: 30.26;] 
[LE:6414] [RE:6707] [DI:direct] 



ORF Name 



NTID 



7V7VTTA NT AA 

AAID — — score 

LengthLength 



probability 



IAI7503000983 9767263 c2 866 



] P 7 | |4399 |p^~[|212 ||280 ||1.6e-24 



Description 

pir: [LN:A69922] [AC:A69922 ] [PN : phage- related replication protein homolog yoqZ] 
[GN:yoqZ ] [CL : phage- related replication protein] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell85518:g2 634439] [LN : BSUB0011] [AC : 299114 :AL009126] [GN:yoqZ] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
11 of 21): from 2000171to 2207900.] [NT:similar to phage-related protein] [LE:189032] 
[RE:189856] [DI : complement] >gp : [GI : g3025599] [LN : AF020713] [AC : AF020713] [GN:yoqZ] 
[OR bacteriophage SPBc2] [DB : genpept -phg] [DE : Bacteriophage SPBc2 complete genome.] 
[NT:similar to bacteriophage SPP1 ORF37.1 product] [LE:95664] [RE:96488] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ — , score 
LengthLength 



LAI7503000983 978450 ci 804 



] |628 | [4400 |pMT]p2 Ip^F 



probability 
4.1e-97 



Description 

gp: [GI:dl025380:g2804528] [LN:D85082] [AC:D85082] [PN:YfiX] [OR:Bacillus subtilis] 
[SR: Bacillus subtilis DNA] [DB : genpept -bet 1] [DE: Bacillus subtilis DNA, genome 
sequence, 79 to 81 degree region.] [LE:7094] [RE:8926] [DI:direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



[AI7503000983 9798180 cl 792 



][ 



|4401 



Description 
NO-HIT 
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NTID 



NT AA 
AAID — -i _ — ^ score 
LengthLength 



probability 



|630 | [ 4402 [p7— [|208 | |7P1~|| 1 . 9e-69 



ORF Name 

IA1750300098J_9a04202_clJ/32 

Description 

pir: [LN:B69727] [AC:B69727 ] [PN : translation elongation factor EF-Ts tsf] [GN:tsf ] 
[CL: translation elongation factor EF-Ts] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:ell85241:g2634022] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : elongation factor Ts] 
[GN:tsf] [ORrBacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome 
(section 9 of 21): from 1598421to 1807200.] [SP:P80700] [LE:119747] [RE:120628] 
[DI: direct] 



ORF Name 



NT AA 
NTID AAID — — , . score 
LengthLength 



probability 



IAI7503000983 9807807 ci 748 



j pi | [4403 ||FH~||290 ||522 ||3.6e-S0 



Description 

gp: [GI:gl842437] [LN : BSU87792] [AC:U87792] [PN:unknown] [ORrBacillus subtilis] 
[DBr genpept-bctl] [DE: Bacillus subtilis tRNA-Ala , phosphatidylglycerophosphate 

synthase (pgsA) and CinA (cinA) genes, complete cds, and RecA (recA) gene, partial cds . ] 
[NT: hypothetical 17.9 kDa protein; ORF158] [LE:3676] [RE:4152] [DIrdirect] 



ORF Name 



NTID 



^ m NT AA 
AAID — . — . . score 
LengthLength 



IA17503000983 9814213 El 499 



probability 
| | 1.9e-19 



Description 

pir: [LN:C69419] [AC:C69419 ] [ PN : phosphate ABC transporter, periplasmic 

phosphate -binding protein (phoX) homolog] [CL:sphX protein] [OR : Archaeoglobus fulgidus] 
[DB:pir2] >gp : [GI :g2649219] [LN : AE001010] [AC : AE001010 : AE000782] [ PN : phosphate ABC 

transporter, periplasmic] [GN:AF1356] [OR: Archaeoglobus fulgidus] [DB :genpept-bct2] 
[DE: Archaeoglobus fulgidus section 97 of 172 of the complete genome.] [NT:similar to 

PID:1052826 percent identity: 25.09;] [LE:4300] [RE:5283] [DIrdirect] 



ORF Name 



17503000983 SB626Vb t2 310 



Description 
NO-HIT 



NTID 



_ __ NT AA 
AAID — — , score 
LengthLength 



probability 



[4405 ||135 [ 



ORF Name 



U7S03000983 995967 c2 888 



NT 



AA 



NTID AAID — , 



LengthLength 



score 



] P 4 I | 4405 |[7¥7~||248 |p¥T 



probability 
5.4e-3i 



Description 

pir: [LN:B70039] [AC:B70039 ] [PN : hypothetical protein yvfS] [GN:yvfS ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell86096 :g2 635921] [LN : BSUB0018] [AC : Z 99121 : AL009126] 
[GN:yvfS] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 18 of 21): from 3399551to 3609060.] [LE:97127] [RE:97864] 
[DI complement] >gp: [GI : e313010 : gl945719] [LN: BSZ94043] [AC:Z94043] [PN : hypothetical 
protein] [GN:yvfS] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic DNA 
fragment (88 kb) . ] [NT:probable permease] [LE:83807] [RE:84544] [DI:direct] 



ORF Name 



„ mTr> _ _ _ ._ NT AA 

NTID AAID — _ — _ score 

LengthLength 



probability 



p75Q3000984_10017151_r3_388 ' 

Description 

NO-HIT 



] I 635 | | 4407 1P^~1| 43 | 
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ORF Name 



IA175030009U4 1U039050 ti 17 



Description 
NO-HIT 

ORF Name 



IA17503000984 10193760 ti 89 



Description 
NO-HIT 



NT AA 
NT ID AAID — 1 _, T — , . score 
Length Length 



~J |S36 | [4408 ||165 | |b4 | 



NT ID 



[4409 



1155 I 



IT?"" 



probability 



„ T „ NT AA 

AAID — , — . . score 

LengthLength 



probability 



ORF Name 



NT ID 



AAID 



NT 



AA 



T — T — . score probabi lity 
LengthLength c 



IAI7503000984 10553125 r3 371 



[638 | [4410 [|81Q— 1|269 ||335 |[1.6e-31 



Description 

gp: [GI:g211700] [LN:CHKCX] [AC:M13496] [PN:type X collagen] [ORrGallus gallus] 
[SR: Chicken red blood cell DNA, clone pYN92El; and embryo chondrocyte] [DB : genpept - vrt ] 
[DE:Chicken type X collagen gene.] [LE:<380] [RE:2208] [Dlidirect] 



ORF Name 



NT AA 

NTID AAID — — , . score probabilit y 
LengthLength c -L 



IAI7503000984 10665903 ci 491 



4411 1(173" 



151 ||6.0e-10 



Description 

sp: [LN:YCGC_ECOLI] [AC : P37349 : P76013] [GN: YCGC] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 

51.6 KD PROTEIN IN TREA-PTH INTERGENIC REGION] [SP : P3734 9 : P76013] [DB : swissprot] 

>pir : [LN:C64866] [AC:C64866 ] [PN : trehalase precursor] [GN:ycgC ] 

[CL: phosphotransferase system phosphohistidine- containing protein homology] 

[OR: Escherichia coli] [DB:pir2] >gp : [GI : dl037041 :g4062781] [LN:D90754] 

[AC:D90754 :AB001340] [PN : Hypothetical protein in treA 5 ' region .] [GN:ycgC] 

[OR: Escherichia coli] [SR : Escherichia coli (strain : K12) DNA, clone :Kohara clone #245] 

[DB: genpept -bet 1] [DE Escherichia coli genomic DNA. (26.8 - 27.1 min) . ] 

[NT:ORF_ID:o245#7; similar to SwissProt Accession] [LE:9793] [RE: 11214] [DI : complement] 

>gp: [GI :gl787448] [LN : AE000218] [AC : AE000218 : U00096] [PN:putative PTS system enzyme I] 

[GN:ycgC] [FN: putative transport; Not classified] [OR : Escherichia coli] 

[DB : genpept -bet 2] [DE : Escherichia coli K-12 MG1655 section 108 of 400 of the 

completegenome . ] [NT:f473; 100 pet identical to fragment YCGC_ECOLI] [LE:3099] 

[RE : 4520] [DI : complement] 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



IAI7503000984 10828312 ti 104 



[4412 |piT]|538 |p7? 



score probability 



Description 

pir: [LN:D69796] [AC:D69796 ] [PN : two -component sensor histidine kinase homolog yesM] 
[GN:yesM ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82674 :g2633008] [LN: BSUB0004] 
[AC:Z99107:AL009126] [GN :yesM] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[NT: similar to two-component sensor histidine kinase] [LE: 157527] [RE: 159260] 
[DI : direct] 
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ORF Name 



NT AA 
NTID AAID — « _ — _ score 
Length Length 



probability 



IAI7503000984 1182765 c3 673 



] [641 | [4413 |[7?TF]|2404 |p09||0.0 



Description 

gp: [GI :g2982196] [LN : AF007865] [AC : AF00786 5] [PN :bacitracin synthetase 3] [GN:bacC] 
[OR: Bacillus lichenif ormis] [DB : genpept-bct2] [DE: Bacillus lichenif ormis bacitracin 

synthetase operon, completesequence ; BacS (bacS) , BcrA (bcrA) , BcrB (bcrB) , and BcrC 
(bcrC)genes, complete cds . ] [NT:peptide synthetase; BA3 / BacC] [LE:25258] [RE:44337] 
[DI : direct] 



ORF Name 



NT AA 
NTID AAID — — ^ score 
LengthLength 



probability 



IAI7503000984 11852518 c2 558 



J |£42 | |4414 ||T33 I j£ 



Description 
NO-HIT 

ORF Name 



NTID AAID 



NT AA 
Le^thLe^th ^g probability 



IAI7503000984 11895058 cl 444 



|4415 |pJ9~ ||312 ||164 j|5 . le-10 



Description 

pir: [LN:S25140] [AC:S25140 ] [PN:serine proteinase homolog] [OR : Enterococcus faecalis] 
[DB:pir2] >gp: [GI:g43338] [LN: EFSPREG] [AC.-Z12296] [PN : Staphylococcal serine proteinase 
homologue] [GN : sprE] [OR : Enterococcus faecalis] [DB :genpept-bctl] [DE : E . faecalis sprE 
gene for serine proteinase homologue.] [LE:91] [RE: 945] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



^17503000984 12554627 c3 625 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



LengthLength 



AA 

— , score probability 



1AI7503000984 1367200 t2 254 



] P 5 I I 4417 |p^~|[207 ||TuT—||0. 00095 



Description 

sp: [LN:VS10_ROTBS] [AC:P34718] [GN:S10] [OR:BOVINE ROTAVIRUS] [SR:GROUP C / SHINTOKU, ] 
[DE : MINOR OUTER CAPSID PROTEIN (NS26) ] . [SP:P34718] [DB : swissprot] 



ORF Name 



_ ___ __ NT AA 

NTID AAID — , — , score 
LengthLength 



probability 



1A17503000984 1367202 cl 489 



j |645 | [4418 |p3~ ||320 [ p5~ ||i . ie-60 



Description 

pir: [LN:E64866] [AC:E64866 ] [PN : hypothetical protein bl200] [OR : Escherichia coli] 
[DB:pir2] >gp: [GI :dl 037043 :g4 062783] [LN:D90754] [AC : D90754 : AB001340] [PN : Hypothetical 

protein] [OR : Escherichia coli] [SR : Escherichia coli (strain : K12 ) DNA, clone :Kohara clone 

#245] [DB:genpept-bctl] [DE : Escherichia coli genomic DNA. (26.8 - 27.1 min) . ] 

[NT:ORF_ID:o246#2; similar to PIR Accession Number] [LE: 11865] [RE: 12965] 

[DI: complement] >gp: [GI :gl787450] [LN: AE000218] [AC : AE000218 :U00096] [PN:putative 

dihydroxyacetone kinase (EC 2.7.1.2)] [GN:bl200] [FN:putative enzyme; Not classified] 
[OR Escherichia coli] [DB :genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 108 of 

400 of the completegenome. ] [NT:f366; 35 pet identical (32 gaps) to 355 residues] 
[LE : 5171] [RE : 6271] [DI : complement] 
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ORF Name 



NT AA 
NTID AAID — ■> _ — ^, score 
LengthLength 



probability 



IAI7503000984 1367J40 cl 424 



|647 [ [4419 |pT~|p90 |p5TT-|p.9e-31 



Description 

gp: [GI :el87587 :gl420862] [LN: SPOPPDACA] [AC:X89237] [PN : oligopeptidepermease] [GN:oppD] 
[OR: Streptococcus pyogenes] [DB :genpept-bctl] [DE : S . pyogenes DNA for oppA, oppB, oppC, 
oppD, oppF, and dacA genes.] [LE:5854] [RE:6924] [DI:direct] 



ORF Name 



NTID 



1AI7503000984 13707008 t3 321 



Description 
NO-HIT 



NT AA 
AAID — — , , score 
Length Length 



probability 



[4420 |pn | [75" 



ORF Name 



LAI7503000984 1385962 c3 666 



NTID 



ST9~ 



AAID 



NT 



AA 



LengthLength 



score 



|442i | prr 



TuTS" 



probability 
l.6e-102 



Description 

sp: [LN:BIOB_BACSU] [AC:P53557] [GNrBIOB] [OR:BACILLUS SUBTILIS] [EC:2.8.1.6] [DE : BIOTIN 
SYNTHASE, (BIOTIN SYNTHETASE) ] [SP:P53557] [DB : swissprot ] >pir : [LN : D69594] [AC:D69594 ] 
[PN:biotin synthetase bioB] [GN : bioB ] [CL:biotin synthetase] [OR: Bacillus subtilis] 
[DB:pir2] >gp : [GI : gl27702 9] [LN : BSU51868] [AC:U51868] [PN:biotin synthase] [GN:bioB] 
[FN:biotin pathway] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
biotin biosynthetic operon genes, complete andpartial cds . ] [LE:6088] [RE: 7095] 
[DI:direct] >gp: [GI :ell85893 :g2635504] [LN : BSUB0016 ] [AC : Z99119 : AL009126] [PN:biotin 
synthetase] [GN :bioB] [FN:biotin biosynthesis] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[EC:2.8.1.-] [DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 
3213410.] [SP:P53557] [LE:91793] [RE:92800] [DI : complement] >gp : [GI : g2293187] 
[LN:AF008220] [AC : AF008220] [PN:biotin synthase] [GN : bioB] [OR:Bacillus subtilis] 
[DB:genpept-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic region.] [LE:87627] 
[RE : 88634] [DI : direct] 



ORF Name 



NT 

NTID AAID — 



AA 



LengthLength 



score 



IAI7503000984 14492327 cl 486 



[4422 |p^~[|142 |pr" 



probability 
2.6e-47 



Description 

sp: [LN:FOSB_STAEP] [AC:Q03377] [GN : FOSB] [OR : STAPHYLOCOCCUS EPIDERMIDIS] [DE : FOSFOMYCIN 
RESISTANCE PROTEIN] [SP:Q03377] [DB : swissprot] >pir : [LN : B48175] [AC:B48175 ] 
[PN: f osf omycin resistance protein B] [CL : f osf omycin resistance protein] 
[OR: Staphylococcus epidermidis] [DB:pir2] >gp: [GI:g46982] [LN : SEFOSB] [AC:X54227] 
[GN : f osB] [OR staphylococcus epidermidis] [DB :genpept-bctl] [DE : S . epidermidis plasmid 
pIP1842 fosB gene for FOSB.] [SP:Q03377] [LE:714] [RE:1133] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — _^ — score 
Length Length 



probability 



A*7503000984_14510962_cl_446 

Description 

NO-HIT 



] EE 1 1 4423 1P^~|| 45 I 
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NT AA 

ORFName NTID AAID Le ~ thLe — th ^ore probability 



AI75030009fl4jL4H9b212_cJJ>J9 | [ 552 | [ 4424 | p2~ || 273 |pgQ— le-9ff 



Description 

gp: [GI:g3800821] [LN : AF076683 ] [AC : AF076683 ] [PN : oligopeptide transporter putative 
ATPase domain] [GN:opp-lD] [OR : Staphylococcus aureus] [DB : genpept - be t 2 ] 
[DE: Staphylococcus aureus oligopeptide transporter putative substratebinding domain 
(opp-lA) , oligopeptide transporter putative membranepermease domain (opp-lB) , 
oligopeptide transporter putativemembrane permease domain (opp-lC) , oligopeptide 
transporterputative ATPase domain (opp-lD) , and oligopeptide transporterputative ATPase 
domain (opp-lF) genes, complete cds; and unknowngene . ] [LE:3335] [RE:4150] [DIrdirect] 

NT AA 

ORF Name NTID AAID — . — , , score probability 
LengthLength c *- 



|AI75030005a4_i57625_ci_42S | [ 553 | [4425 | p7T"] p03 p 



.00054 



Description 

gp: [GI:el407888:g4493994] [LN : PFMAL3P7] [AC : AL034559] [GN:MAL3P7 . 44] [OR : Plasmodium 
falciparum] [SRimalaria parasite P. falciparum] [DB : genpept -invl] [DE : Plasmodium 
falciparum MAL3P7, complete sequence.] [NT : predicted using hexExon; MAL3P7.44 
(PFC1065w),] [LE:188885] [RE:191470] [DIrdirect] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



p75030009B4_162550_ca_6SO | | 6b4 | [4426 | p^— || 49 || 0.03b 

Description 

Sp : [LN : YPMB_BACSU] [AC:P54396] [GN: YPMB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 17.9 
KD PROTEIN IN DING-ASPB INTERGENIC REGION] [SP:P54396] [DB : swissprot] >pir : [LN : F6993 8] 
[AC:F69938 ] [PN : hypothetical protein ypmB] [GN : ypmB ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:gll46245] [LNiBACYPIA] [AC:L47709] [GNrypmB] [FN : hypothetical] [ORiBacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis (clone YAC15-6B) ypiABF genes, qcrABC 
genes , ypjABCDEFGHI genes, birA gene, panBCD genes, dinG gene, ypmB gene,aspB gene, asnS 
gene, dnaD gene, nth gene and ypoC gene, completecds ' s . ] [NT : putative] [LE : 18616] 
[RE:19101] [DIrdirect] >gp: [GI : ell83683 : g2634656] [LN: BSUB0012] [AC : Z99115 : AL009126] 
[GNrypmB] [FN:unknown] [ORrBacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis 
complete genome (section 12 of 21) : from 2195541to 2409220.] [SP:P54396] [LE:152566] 
[RE : 153051] [DI : complement] 

NT AA 

ORF Name NTID AAID t — ^ T — , , score probabili ty 
LengthLength c L - 



Al7S030009B4jL52578_cl_42S | |655 | [ 4427 |p^r~j| 327 || 338 ||T7Te^U 



Description 

pir: [LN:A69867] [AC:A69867 ] [PN : conserved hypothetical protein ykuT] [GNrykuT ] 
[ORrBacillus subtilis] [DB:pir2] >gp: [GI : ell81925 :g2632241] [LN: BS16829KB] 
[AC:AJ222587] [PN : YkuT protein] [GNrykuT] [ORrBacillus subtilis] [DB : genpept -bet 1] 
[DEr Bacillus subtilis 29kB DNA fragment from ykwC gene to csel5 gene.] [LE: 25580] 
[RE:26383] [DI : complement] >gp: [GI : ell85011 rg2633792] [LN:BSUB0008] 

[AC:Z99111:AL009126] [GN:ykuT] [FNrunknovm] [ORrBacillus subtilis] [DB r genpept -bctl] 
[DErBacillus subtilis complete genome (section 8 of 21): from 1394791to 1603020.] 
[NT: similar to hypothetical proteins] [LE: 95836] [RE: 96639] [DI : complement] 



NT AA 

ORF Name NTID AAID — , — , score probability 



AI7603000&B4_164B607SJ:2_i36 1 

Description 
NO-HIT 



LengthLength 
[4428 JTO 
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ORF Name 



NTID 



NT 

AAID _ — t 



AA 



LengthLength 



score probability 



1AI7503000984 16b902 c3 574 



] | g57 | | 4429 H 627 |p 08 ||1.8e-i4 



] 



Description 

sp: [LN:LP14_BACSU] [AC:P39144] [GN:LPA-14] [OR:BACILLUS SUBTILIS] [DE : LIPOPEPTIDE 
ANTIBIOTICS ITURIN A AND SURFACTIN BIOSYNTHESIS PROTEIN] [SP:P39144] [DB : swissprot] 
>pir: [LN: 139875] [AC:I39875 ] [PN : siderophore biosynthesis regulatory protein 
sfp : lipopeptide antibiotics iturin Arsurfactin production protein] 
[GN:lpa-14 :sfb:sfp:sfp (0) ] [CL : siderophore biosynthesis regulatory protein sfp] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl005421 :g473916] [LN: BACLPA14] [AC:D21876] 
[PN: lipopeptide antibiotics iturin A] [GN:lpa-14] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (strain:RB14) DNA] [DB :genpept-bctl] [DE : B . subtilis lpa-14 gene encoding 
lipopeptide antibiotics iturin A.] [LE:1] [RE: 675] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID — — , 
LengthLength 



score probability 



lA175030uu984 I8756I cl 470 



] I 558 | |4430 ||879 



Description 
NO-HIT 



ORF Name 



NTID 



7V7vtt ^ NT AA 

AAID — — , . score 

LengthLength 



probability 



V17b0a000984 194010 c3 607 



] |559 | [4431 ||IgI2"||S03 | [1381 ||3 \ 4e-141 



Description 

gp: [GI:dl039113 :g4514332] [LN : AB013369] [AC : AB01336 9] [OR:Bacillus halodurans] 
[SR:Bacillus halodurans (strain:C-125) DNA] [DB : genpept -bctl] [DE:Bacillus halodurans 
C-125 yesT and comEC genes, partial andcomplete cds . ] [NT:unknown] [LE:4328] [RE:5830] 
[DI :direct] 



ORF Name 



NTID 



AAID . — 



IAI7503000984 19688401 c2 51S 



14432 



NT AA 
LengthLength 



score probability 



Description 
NO-HIT 

ORF Name 



NTID 



_ _ -r-y^ NT AA 

AAID — , — , score 

LengthLength 



probability 



U7B03000984 19773387 t3 353 



][ 



[4433 |pgB"||41B ||745 ||5.6e-74 



Description 

sp : [LN : STPA_STAAU] [AC:P81297] [OR : STAPHYLOCOCCUS AUREUS] [EC:3.4.22. 
[DE:STAPHOPAIN, ] [SP:P81297] [DB : swissprot] 



ORF Name 



NT AA 
NTID AAID — _ „. — , score 
LengthLength 



probability 



^17503000984 20585963 £3 396 



] | [4434 ||gTT~| |25fl | p7~ ||i . 8e-64 



Description 

gp: [GI:g929972] [LN:BAU30714] [AC:U30714] [OR:Bacillus anthracis] [SR:plasmid pXOl] 

[DB:genpept-bctl] [DE:Bacillus anthracis Weybridge A toxin plasmid pXOl right 
invertedrepeat element (WeyAR) bordering the toxin- encoding region, ORFAand ORFB genes, 
complete cds.] [NT : ORFB ; similar to B. anthracis SterneL element ORFB;] [LE:512] 

[RE:1336] [DI:direct] >gp : [GI : g929975] [LN : BAU30715] [AC:U30715] [OR:Bacillus 
anthracis] [SR:plasmid pXOl] [DB : genpept -bet 1 ] [DE:Bacillus anthracis Sterne toxin 
plasmid pXOl left inverted repeatelement (SterneL) bordering the toxin- encoding region, 
ORFB andtruncated ORFA genes, complete cds.] [NT: ORFB; similar to B. anthracis WeyAR 
element ORFB;] [LE:458] [RE:1282] [DI: direct] >gp : [GI : g4894312] [LN: AF065404] 

[AC:AF065404] [PN:pX01-96] [OR:Bacillus anthracis] [DB : genpept -bct2] [DE:Bacillus 
anthracis virulence plasmid PXOl, complete sequence.] [LE: 116307] [RE: 117131] 

[DI:direct] 
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ORF Name 



[AI7503000984 20704012 rl 105 



Description 
NO-HIT 



NT ID 



NT AA 
AAID — _ — , , score 
LengthLength 



probability 



[553 | [4435 [79 | 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000984 20979688 c2 511 



][ 



|443£ ||786 ||26i ~|p^5~ 



probability 
5.9e-34 



Description 

sp: [LN:NIKC_ECOLI] [AC:P33592] [GN:NIKC] [OR : ESCHERICHIA COL I] [DE : NICKEL TRANSPORT 
SYSTEM PERMEASE PROTEIN NIKC] [SP:P33592] [DB : swissprot] >pir: [LN:S39596] 
[AC:S39596:S47697:A65145 ] [PN:nikC protein] [GNmikC ] [CL : oligopeptide permease 
protein oppB] [OR : Escherichia coli] [DB:pir2] >gp : [GI : g581141] [LN : ECNIK] [AC:X73143] 
[PN:NikC] [GNmikC] [OR : Escherichia coli] [DB : genpep t - bet 1 ] [DE:E.coli DNA sequence of 
nik locus.] [SP:P33592] [LE:2942] [RE:3775] [DIrdirect] >gp : [GI :g912461] [LN:ECOUW76] 
[AC:U00039] [GNmikC] [OR : Escherichia coli] [SR : Escherichia coli (substrain MG1655, 
strain K-12) (library: lambda] [DB : genpept - be t 1 ] [DE : E . coli chromosomal region from 
76.0 to 81.5 minutes.] [LE:30444] [RE:31277] [DI:direct] >gp : [GI : gl789889] 
[LN:AE000423] [AC : AE000423 :U00096] [PN : transport of nickel, membrane protein] [GNmikC] 
[FN: transport; Transport of small molecules:] [OR : Escherichia coli] [DB : genpept -bct2] 
[DE: Escherichia coli K-12 MG1655 section 313 of 400 of the completegenome . ] [NT:o277] 
[LE:7496] [RE:8329] [DIrdirect] 



ORF Name 



NT ID 



NT 



AA 



AAID — r — , 
Length Length 



score probability 



IAI7503000984 212827 t3 295 



] |665 | [4437 |p^T||493 | p5"~ ||2 . 6e-40 



Description 

pir: [LN:G70006] [AC:G70006 ] [PN : mult idrug resistance protein homolog yubD] [GN : yubD ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85986 :g2635597] [LN : BSUB0016] 
[AC:Z99119 :AL009126] [GN:yubD] [FNmnknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 3213410.] 
[NT: similar to multidrug resistance protein] [LE: 193143] [RE: 194678] [DI : complement] 



ORF Name 



NT 



AA 



NT ID AAID Le ~ thLe ~ th score probability 



1A1 7 503000984 21588287 t2 271 



Description 
NO-HIT 



] | [4438 |[T?5 | pT 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



AI7B03000984_2i307016J:ijL23 

Description 

NO-HIT 



j [657 | [4439 |p2~~ 1 |43 | 
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ORF Name 



NT AA 
NT ID AAID — . _ — , . scor e 
LengthLength 



probability 



IAI7503000984 22042337 c2 588 



] |658 | [4440 ||T¥5T1|477 ||981 ||8.3e-99 



Description 

pir: [LN:F69763] [AC:F69763 ] [PN : multidrug resistance protein homolog ycnB] [GN:ycnB ] 
[CL:lincomycin-resistance protein lmrB] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell82351:g2632685] [LN : BSUB0003] [AC : Z99106 : AL009126] [GN: ycnB] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 

3 of 21): from 402751 to611850.] [NTrsimilar to multidrug resistance protein] 
[LE:32866] [RE:34284] [DI : complement] >gp : [GI : dl009651 : gl805454] [LN:D50453] 
[AC:D50453] [PN : homologue of multidrug resistance protein B,] [GN: ycnB] [OR:Bacillus 

subtilis] [SR:Bacillus subtilis (strain:168 trpC2) DNA] [DB : genpept-bctl] [DE:Bacillus 

subtilis DNA for 25-36 degree region containing theamyE-srfA region, complete cds . ] 
[LE: 115269] [RE: 116687] [DI : complement] 



ORF Name 



F 



17503000984 22323413 C2 518 



NTID 



„ TO NT AA 

AAID — , _ — . . score 

LengthLength 



probability 



14441 



144 IPT7 



Description 
NO-HIT 



ORF Name 



NTID 



IAI7503000984 22464127 r2 191 



][ 



IS7TT 



NT AA 

AAID , , 

LengthLength 

|4442 ||153 1 150 I 



score probability 



Description 
NO-HIT 

ORF Name 



IAI7503000984 22537818 c2 515 



NTID 



F7T 



AAID 



Description 
NO-HIT 



NT AA 
LengthLength 
[4443 |p5u~| |49 | 



— 4-Vi SCQre probability 



ORF Name 



NTID AAID , 



IAI7503000984 22554140 13 3S5 



Z7T 



NT AA 
LengthLength 



score probability 
191 ||1.0e-ii 



Description 

pir: [LN:F69280] [AC:F69280 ] [PN:iron (II) transporter (feoB-1) homolog] [CL:ferrous 
iron transport protein B : translation elongation factor Tu homology] [OR : Archaeoglobus 
fulgidus] [DB:pir2] >gp : [GI :g2650395] [LN: AE001089] [AC : AE001089 : AE000782] [PN:iron 
(II) transporter (feoB-1)] [GN:AF0246] [OR : Archaeoglobus fulgidus] [DB : genpept-bct2] 
[DE: Archaeoglobus fulgidus section 18 of 172 of the complete genome.] [NT: similar to 
GB:L77117 SP:Q57986 PID:1591272 percent] [LE:10039] [RE:11958] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID , — 



LengthLength 



score 



[AI7503000984 22664S50 c2 512 



"j |4445 |JSF1 [ |220 IP 7 ^" 



probability 
b.4e-24 



Description 

pir: [LN:C65145] [AC : C65145 : S3 9598 : S47699 ] [PN:nikE protein] [GN:nikE ] [CL : unas signed 
ATP-binding cassette proteins: ATP-binding cassette homology] [OR: Escherichia coli] 
[DB:pir2] >gp : [GI :g466616] [LN:ECOUW76] [AC:U00039] [GN:nikE] [OR: Escherichia coli] 
[SR: Escherichia coli (substrain MG1655, strain K-12) (library: lambda] 

[DB: genpept-bctl] [DE : E . coli chromosomal region from 76.0 to 81.5 minutes.] [LE: 32038] 
[RE:32844] [DI: direct] >gp : [GI : gl789891] [LN : AE000423] [AC : AE000423 : U00096] 
[PN: ATP-binding protein of nickel transport system] [GN:nikE] [FN: transport ; Transport 
of small molecules:] [OR Escherichia coli] [DB : genpept -bct2 ] [DE : Escherichia coli K-12 
MG1655 section 313 of 400 of the completegenome . ] [NT:o268] [LE:9090] [RE: 9896] 
[DI: direct] 



223 



NT AA 

ORFJJame NTID AAID ^ thLe — ^ score probability 



AI7B030009fl4_amJ4:*2_ci_448 | |674 | [4446 ||384 | |127 ||152 ||2,4e-10 



Description 

sp: [LN:INVO_PIG] [AC:P18175] [GN:IVL] [OR : SUS SCROFA] [SR:,PIG] [DE : INVOLUCRIN] 
[SP:P18175] [DB:Swissprot] >pir : [LN : 146592] [AC:I46592 ] [PN : involucrin] 
[CL:involucrin] [OR:Sus scrofa domestica] [SR:, domestic pig] [DB:pir2] 

>gp: [GI:gl64523] [LN: PIGINVOLA] [AC:M34441] [OR : Sus scrofa] [SR:Pig (Yorkshire) adult 
skin keratinocyte DNA] [DB : genpept-mam] [DE:Pig involucrin gene, complete cds . ] 
[NT : involucrin] [LE:1] [RE: 1044] [DI : direct] 



NT AA 

ORF Name NTID AAID — _„ — _ score probability 
Length Length c ^ 



|ftI7S03000384_2344442B_ci_426 | [675 | [4447 | 28i || 362 || 3.2e-33 

Description 

pir: [LN:B69834] [AC:B69834 ] [PN : conserved hypothetical protein yhjK] [GN:yhjK ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83056 :g2633390] [LN :BSUB0006] 
[AC:Z99109:AL009126] [GN:yhjK] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to hypothetical proteins] [LE: 127427] [RE: 128287] [DI : complement] 
>gp: [GI:e324984:g2226183] [LN : BSY14 081] [AC:Y14081] [PN : hypothetical protein] [GN:yhjK] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis chromosomal DNA, region 
92 degrees: regionbetween comK and addAB.] [NT : Similarity to a large family of 
hypothetical] [LE:9722] [RE: 10582] [DI : complement] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|AI7B03000984_23470250_c2_B64 | [575 | [4448 |pT5~| | 304 || 120B ||ITB e-122 | 

Description 

gp: [GI :g3800820] [LN: AF076683] [AC : AF076683] [PN : oligopeptide transporter putative 
membrane] [GN:opp-lC] [OR : Staphylococcus aureus] [DB : genpept - be t 2 ] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:2469] [RE:3338] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , _ sco re probability 
LengthLength • L - 

|A17b03000984_23S837_cl_47i | |677 | [ 4449 |p?T] p^" 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength u 

AI7S03000984_2360iS10_ci_4a6 | |678 | [ 4450 |pTT7"| | 378 | pTI~] p . 2e-9i "| 



Description 

sp : [ LN : GLDA_BAC ST] [AC:P32816] [GN : GLDA : GLD] [OR:BACILLUS S TE AROTHERMO PH I LUS ] 

[EC: 1.1. 1.6] [DE : GLYCEROL DEHYDROGENASE, (GLDH) ] [SP:P32816] [DB : swissprot] 

>pir: [LN: JQ1474] [AC : JQ1474 : S38514 ] [PN:glycerol dehydrogenase,] [GN:gldA ] 

[CL: glycerol dehydrogenase : lac taldehyde reductase homology] [OR: Bacillus 

stearothermophilus] [EC:1.1.1.6] [DB:pir2] >gp: [GI:gl42978] [LN : BACGLDA] [AC:M65289] 

[PN: glycerol dehydrogenase] [GN:gld] [OR: Bacillus stearothermophilus] [SR: Bacillus 

stearothermophilus (sub_species nondiastaticus) (library] [DB : genpept-bctl] 

[EC: 1.1. 1.6] [DE: Bacillus stearothermophilus glycerol dehydrogenase (proposed gld)gene, 

complete cds.] [LE:742] [RE: 1854] [DI: direct] 



224 



_ NT AA 

QRF Name NTID AAID thLe — fch score probability 



A17S030009B4_2J6bm8_cl_4S9 | |6 7 9 | | 44bl | |i66 1 155 | |83 | | 0.0012 



Description 

sp: [LN:GGI3_STAHA] [AC:P11699] [OR : STAPHYLOCOCCUS HAEMOL YT I CUS ] [DE : ANTIBACTERIAL 
PROTEIN 3 (GONOCOCCAL GROWTH INHIBITOR 3)] [SP:P11699] [DB : swissprot] >pir : [LN : BXSA3 ] 
[AC:S00601 ] [PN : antibacterial protein 3:gonococcal growth inhibitor 3] 

[CL: Staphylococcus haemolyticus antibacterial protein] [OR : Staphylococcus haemolyticus] 
[DBrpirl] 

NT AA 

QRF Name NT ID AAID Le ~ thLe ~ th score probability 



AI7S03000984_23860307_c3_64i | |580 | [ 4452 | pT3~|| 405 | | i603 | | i ■ 0e-i64 



Description 

gp: [GI :g3800823] [LN: AF076683] [AC : AF076683 ] [PN:unknown] [OR : Staphylococcus aureus] 
[DB : genpept -bct2 ] [DE : Staphylococcus aureus oligopeptide transporter putative 
substratebinding domain (opp-lA) , oligopeptide transporter putative membranepermease 
domain (opp-lB) , oligopeptide transporter putativemembrane permease domain (opp-lC) , 
oligopeptide transporterputative ATPase domain (opp-lD) , and oligopeptide 
transporterputative ATPase domain (opp-lF) genes, complete cds; and unknowngene . ] 
[NTrorfX] [LE:4904] [RE: 6097] [DIrdirect] 

NT AA 

QRF Name NT ID AAID — — , , sco re probability 

LengthLength c JL 

|A17b03000984 2 4000275_r3_303 | [681 | [4453 [ jT^TJ | 341 | p.033 ||2 . 5e-104 | 



Description 

sp: [LN : OTCC_HAEIN] [AC:P44770] [GN : ARCB : HI 05 96] [OR : HAEMOPHILUS INFLUENZAE] 
[EC: 2. 1.3. 3] [DE : ORNITHINE CARBAMOYLTRANSFERASE , CATABOLIC, (OTCASE) ] [SP:P44770] 
[DB: swissprot] >pir : [LN: H64079] [AC:H64079 ] [PN : ornithine carbamoyl transf erase, ] 
[CL : ornithine carbamoyl transf erase : aspartate/ornithine carbamoyl transferase homology] 
[OR Haemophilus influenzae] [EC:2.1.3.3] [DB:pir2] >gp: [GI :gl573585] [LN:U32741] 
[AC:U32741 :L42023] [PN : ornithine carbamoyl transf erase (arcB) ] [GN:HI0596] 
[OR : Haemophilus influenzae Rd] [DB :genpept-bct2] [DE : Haemophilus influenzae Rd section 
56 of 163 of the complete genome.] [NT:similar to GB:X05637 SP:P08308 PID:45288 
percent] [LE:3467] [RE: 4471] [DI : complement] 

NT AA 

QRF Name NT ID AAID — — , . score probability 
LengthLength c L - 



AI7S03000984_2405B367_ci_438 | |682 | [4454 |[T?13~| |42 [ 



Description 
NO-HIT 



ORF Name 



IAI7503000984 24105393 c2 5S2 



1 



NTID 



TVTVTT^ NT ^ 

AAID — — _ score 
LengthLength 



probability 



14455 



1758 I 



7TT 



3.4e-70 



Description 

gp: [GI:el456529:g4914622] [LN : LMAJ962 7] [AC : AJ00962 7] [PN : pyruvate- formate lyase 
activating enzyme] [GN:pflC] [OR:Listeria monocytogenes] [DB : genpept-bctl] [DE:Listeria 
monocytogenes pflC, orfA, lltB and orfC genes.] [NT : putative ; similar to Streptococcus 
mutans PflC] [LE:149] [RE: 895] [DIrdirect] 
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ORF Name 



NTID 



IAI7503000984 2422b37b c2 533 



]f 



NT AA 

AAID — « _ 

LengthLength 

^ [4455 ||213 |p 



score probability 
T7 ||3.9e-05 



Description 

gp: [GI:g3212079] [LN: AF068633] [AC : AF068633 ] [PNiphenol soluble modulin beta 1] 
[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB : genpept-bct2] 
[DE : Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds . ] [NT:PSM beta 1] [LE:669] [RE: 803] [DI : direct] 

>gp: [GI :g3212080] [LN : AF068633] [AC : AF068633] [PN:phenol soluble modulin beta 2] 

[FN: inflammatory protein] [OR: Staphylococcus epidermidis] [DB : genpept-bct2] 

[DE : Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds.] [NT:PSM beta 2] [LE:859] [RE: 993] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — _ — , . score 
LengthLength 



probability 



1A17503000984 24257881 tl 68 



Description 
NO-HIT 



"] |685 | |4457 ||138 | |45 [ 



ORF Name 



117503000984 24266041 tl 22 



NTID 



Description 
NO-HIT 



a7vtt ^ NT AA 

AAID — _ — _ score 

LengthLength 

[4458 j|129 1 142 | 



probability 



ORF Name 



IAI7503000984 24351678 ii 120 



NTID 



„ T „ NT AA 

AAID — _ score 

Length Length 



probability 



[4459 |p^~| 



Description 
NO-HIT 

ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



IAI7503000984 24407677 t3 293 



J |688 [ [4460 1 177 | 



Description 
NO-HIT 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



IAI7503000984 24650300 cl 476 



] | [4461 ||1365"[|454 |p3T] f 



probability 
9.6e-105 



Description 

sp: [LN : BRNQ_LACDL] [AC:P54104] [GN:BRNQ] [OR : LACTOBACILLUS DELBRUECKI I ] 
[SR: ,SUBSPLACTIS] [DE : CHAIN AMINO ACID UPTAKE CARRIER)] [SP:P54104] [DB : swissprot] 
>pir: [LN:S60180] [AC:S60180 ] [PN : branched- chain amino acid carrier brnQ] [GNrbrnQ ] 
[CL: branched- chain amino acid transport system II carrier protein braZ] 
[OR Lactobacillus delbrueckii] [DB:pir2] >gp : [GI :g732813] [LN : LDBRNQGN] [AC:Z48676] 
[PN: branched- chain amino acid carrier] [GN:brnQ] [FN: transport of branched- chain amino 
acids (Leu,] [OR : Lactobacillus delbrueckii] [DB : genpept-bctl] [DE : L . delbrueckii brnQ 
gene for branched- chain amino acid carrier.] [SP:P54104] [LE:611] [RE:1951] [DI:direct] 
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ORF Name 



NT AA 
NT ID AAID — — x , score 
LengthLength 



probability 



IA17503000984 24664012 t2 199 



] |690 | [4452 ||4B3 | |lb0 | |S2 [[0.0052 



Description 

gp: [GI:dl025730:g2879910] [LN:D85752] [AC:D85752] [GN : bacD] [OR : Enterococcus faecalis] 
[SR:Enterococcus faecalis plasmid:pPDl DNA] [DB :genpept-bctl] [DE : Enterococcus faecalis 
plasmid pPDl bacA, bacB, bacC, bacD, bacE,bacF, bacG, bacH and bad genes, complete 
cds.] [LE:3977] [RE:4324] [DI: direct] 



ORF Name 



NTID 



„ Tn NT AA 

AAID — « — , , score 

LengthLength 



probability 



117503000984 24711588 c2 589 



] [591 | [4453 ||225~| [7¥ 



Description 
NO-HIT 



ORF Name 



NT 

NTID AAID _ — 



AA 



LengthLength 



score 



^17503000984 24884688 c2 509 



592 | [4454 ||S2B~||17B ||216 | [ 



probability 
5.6e-17 



] 



Description 

gp: [GI:g3800818] [LN: AF076683] [AC : AF076683 ] [PN: oligopeptide transporter putative 
substrate] [GN:opp-lA] [OR : Staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:64] [RE: 1524] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ T — . , score 
LengthLength 



IAI7503000984 24886552 c3 635 



[4465 |[752— 1|263 |P^~| [ 



probability 
9.4e-59 



] 



Description 

gp: [GI:dl037145:g4062842] [LN : AB009078] [AC : AB009078] [PN : L-2 . 3 -butanediol 
dehydrogenase] [OR : Brevibacterium saccharolyticum] [SR : Brevibacterium saccharolyticum 
DNA] [DB:genpept-bctl] [DE : Brevibacterium saccharolyticum gene for 
L-2 . 3-butanedioldehydrogenase, complete cds.] [LE:1743] [RE:2519] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID — . T — . 
LengthLength 



score 



IAI7503000984 25429665 ci 496 



][ 



|445g |p^T5~| 



probability 
|4.3e-21 



] 



Description 

pir: [LN:S58131] [AC:S58131 ] [PN:integral membrane protein LmrP] [OR : Lactococcus 
lactis] [DB:pir2] >gp : [GI :gl052754] [LN:LLLMRP] [AC:X89779] [PN: LmrP integral membrane 
protein] [GN:lmrP] [OR: Lactococcus lactis] [DB : genpept-bctl] [DE:L. lactis DNA for LmrP 
gene.] [LE:634] [RE:1860] [DI:direct] 



ORF Name 



NTID AAID 



LAI7503000984 25476378 tl 13 



][ 



NT AA 
LengthLength 
] [4467 [57 



score probability 



Description 
NO-HIT 



ORF Name 



AI7503000984_25894687_c3_624 

Description 

NO-HIT 



NT 



AA 



NTID AAID — _ — , , score probability 
L en gthLength 



[4468 pT 
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ORF Name 



NT AA 

^ AAI5 Le ^thLe^th ^^ probability 



7503000984 2618^681 c3 696 



Description 
NO-HIT 



ORF Name 



^17603000984 26208460 i:i 24 



Description 
NO-HIT 



|697 | [4469 |pg— 1|175 | 



NT AA 

NTID AAID Le ^ thLe ~ th score probability 



698 | [4470 



ST" 



ORF Name 



IA17503000984 26229678 tl 27 



Description 
NO-HIT 



NT AA 

NTID AAID — — , , score prob ability 

LengthLength 



] | 699 1 1 4471 IF 77 "! | 58 | 



ORF Name 



_ _ NT AA 

NTID AAID — ~ — _ score 

LengthLength 



probability 



IAI7603000984 26266641 c2 643 



Description 
NO-HIT 



] |700 | [4472 |[TTr^|48 | 



ORF Name 



NTID 



NT 



AA 



AAID . — , 



LengthLength 



score 



[AI 7 603000984 26367135 cl 608 



701 



[4473 |692 [p^S" 



probability 
2.2e-153 



Description 

sp: [LN:LIP_STAEP] [AC:Q02510] [GN : GEHC] [OR : STAPHYLOCOCCUS EPIDERMIDIS] [EC:3.1.1.3] 
[DE : LIPASE PRECURSOR, (GLYCEROL ESTER HYDROLASE)] [SP:Q02510] [DB : swissprot ] 
>pir: [LN:A47705] [AC:A47705 ] [PN : triacylglycerol lipase,] [CL : Staphylococcus 
triacylglycerol lipase] [OR : Staphylococcus epidermidis] [EC: 3. 1.1.3] [DB:pir2] 
>gp: [GI:gl53022] [LN: STAGEHC] [AC:M95577] [PNrlipase] [GNrgehC] [OR : Staphylococcus 
epidermidis] [SR : Staphylococcus epidermidis (strain 9) DNA] [DB : genpept-bctl] 
[DE: Staphylococcus epidermidis lipase (gehC) gene, complete cds . ] [NT : GTG start codon] 
[LE:121] [RE:2187] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



AI7503000984 26369027 £2 260 



][ 



7u"2~ 



[4474 | jlTT 



probability 
11360 ||6.6e-138 



Description 

pir: [LN:E69806] [AC:E69806 ] [PN : conserved hypothetical protein yf jN] [GN:yf jN ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82793 :g2633127] [LN : BSUB0005] 
[AC:Z99108 :AL009126] [GN:yfjN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 5 of 21): from 802821 tol011250.] 
[NT:similar to hypothetical proteins] [LE:73112] [RE:74089] [DI:direct] 

>gp: [GI :dl025211:g2780398] [LN:D78509] [AC:D78509] [PN: Yf jN] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain :AC327) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 

YfjG-YfjR genes, complete cds.] [LE: 12236] [RE: 13213] [DI : complement] 



228 



ORF Name 



NT 



AA 



NTID AAID . — ^ , — 



LengthLength 



score 



E 



17503000984 26600936 Tl 253 



] |703 | [4475 ||513 | [170 ||T7T 



probability 
2.2e-34 



Description 

sp: [LN:Y318_HAEIN] [AC:P43984] [GN:HI0318] [OR : HAEMOPHILUS INFLUENZAE] [DE : HYPOTHETICAL 
PROTEIN HI0318] [SP:P43984] [DB : swissprot] >pir : [LN: B64006] [AC:B64006 ] 
[PN: hypothetical protein HI0318] [OR : Haemophilus influenzae] [DB:pir2] 
>gp: [GI:gl573288] [LN:U32717] [AC : U32 717 : L42023] [PN : conserved hypothetical protein] 
[GN:HI0318] [OR : Haemophilus influenzae Rd] [DB : genpept -bct2 ] [DE : Haemophilus influenzae 
Rd section 32 of 163 of the complete genome.] [NT:similar to SP:P54158 PID:1256620 
GB:AL009126] [LE:4064] [RE:4582] [DIrdirect] 



ORF Name 



NTID 



IAI7503000984 26759430 c3 €54 



][ 



NT AA 

AAID 

LengthLength 

|4476 [[573 [ p2T 



score probability 
WT2 |1 2.3e-87 



Description 

pir: [LN:E69670] [AC:E69670 ] [PNrglycine betaine/carnitine/choline ABC transporter 
(osmoprotec) opuCC] [GN:opuCC ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell86069:g2635894] [LN:BSUB0018] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCC] [FN: high affinity transport of glycine 
betaine,] [OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbC] 
[LE:67766] [RE:68677] [DI : complement] 



ORF Name 



NT 

NTID AAID . — 



AA 



LengthLength 



score probability 



IAI7503000984 26760076 t3 305 



][ 



7TT5"" 



[4477 |p37F]|525 | [1550 ||4 . 2e-i53 



Description 

sp : [LN: YFCC_HAEIN] [AC:P44023] [GN:HI0594] [OR : HAEMOPHILUS INFLUENZAE] [DE : HYPOTHETICAL 
PROTEIN HI0594] [SP:P44023] [DB : swissprot] >pir : [LN:E64010] [AC:E64010 ] 
[PN: hypothetical protein HI0594] [CL Haemophilus influenzae conserved hypothetical 
protein HI0594] [OR : Haemophilus influenzae] [DB:pir2] >gp : [GI :gl573583] [LN:U32741] 
[AC-.U32741 :L42023] [PN : conserved hypothetical transmembrane protein] [GN:HI0594] 
[OR : Haemophilus influenzae Rd] [DB : genpept -bct2] [DE : Haemophilus influenzae Rd section 
56 of 163 of the complete genome.] [NT: similar to GB:AE000783 percent identity: 35.02;] 
[LE:891] [RE: 2420] [DI : complement] 



ORF Name 



NTID 



AAID 



Ll7b03000984 26774137 c2 574 



Tuir 



NT AA 
LengthLength 
[4478 | [ITS | |2T 



score probability 



Description 
NO- HIT 



_ NT AA 
ORF Name NTID AAID — _ „ — , , score probability 
LengthLength c u 



^17503000984 272593 c3 636 1 (7U7 



|4479 [|gT^~||292 ~||62 ||0.0052 



Description 

sp: [LN:YORB_LISMO] [AC:P33382] [OR: LISTERIA MONOCYTOGENES] [DE : HYPOTHETICAL 12.0 KD 
PROTEIN IN PLCB-LDH INTERGENIC REGION (ORFB) ] [SP:P33382] [DB : swissprot] 
>pir: [LN: 143868] [AC:I43868 ] [PN : ORFB] [CL hypothetical protein MJ14 13] [OR:Listeria 
monocytogenes] [DB:pir2] >gp : [GI : gl49648] [LN : LISACTLDH] [AC:M82881] [OR:Listeria 
monocytogenes] [SR:Listeria monocytogenes (strain L028) DNA] [DB : genpept -bet 1] 
[DE: Listeria monocytogenes lecithinase, lactate dehydrogenase (actA)gene complete cds, 
(plcB) gene complete cds, (ldh) gene completecds . ] [NT : ORFB] [LE:4094] [RE:4426] 
[DI : complement] 



229 



ORF Name 



NT AA 
NTID AAID , — ^. _ — ^, score 
LengthLength 



probability 



1X 7 503000984 2906307 tl 60 



J |708 | [4480 ||T?5~]|41 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



^17503000984 2928437 ci 477 



7TTT 



|448i ||g5T~||3Qg "j p7F~ 



probability 
1.4e-34 



Description 

pir: [LN:A69401] [AC:A69401 ] [PN : conserved hypothetical protein AF1210] 
[ORrArchaeoglobus fulgidus] [DB:pir2] >gp : [GI :g2649377] [LN : AE001021] 
[AC:AE001021:AE000782] [PN : conserved hypothetical protein] [GN:AF1210] 

[ORrArchaeoglobus fulgidus] [DB : genpept-bct2] [DE : Archaeoglobus fulgidus section 86 of 
172 of the complete genome J [NT: similar to GP: 1654020 percent identity: 34.56;] 
[LE : 12088] [RE : 13029] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score probability 



IA17503000984 2929718 cl 463 



|710 | [4482 |p^]|652 |p34B||0.0 



Description 

gp: [GI:g3789932] [LN : AF090142] [AC : AF090142] [PN:lipase precursor] [GN:gehD] 
[OR staphylococcus epidermidis] [DB : genpept-bct2] [EC:3.1.1.3] [DE : Staphylococcus 
epidermidis lipase precursor (gehD) gene, completecds . ] [NT : GehD] [LE:293] [RE:2224] 
[Dlidirect] 



ORF Name 



NTID 



NT AA 
AAID — — , score 
Length Length 



probability 



IA17503000984 29398437 12 132 



7TT 



77" 



Description 
NO-HIT 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000984 3003137 12 215 



] 



ITT 



j [4484 ||30i ||294 ||5.2e-2g 



Description 

pir : [LN:H70313] [AC:H70313 ] [PN: cobalamin synthesis related protein CobW] [GN:cobW ] 
[OR:Aquifex aeolicus] [DB:pir2] >gp : [GI :g2982874] [LN: AE000675] [AC : AE000675 : AE000657] 
[PN: cobalamin synthesis related protein CobW] [GN:cobW] [OR:Aquifex aeolicus] 
[DB:genpept-bct2] [DE:Aquifex aeolicus section 7 of 109 of the complete genome.] 
[LE: 11287] [RE: 12165] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



Al7^03000984_30103592_c3_600 

Description 

NO-HIT 



][ 



TO" 



[4485 |P :? ^~~| |43 | 



230 



NT AA 

ORF Name NTID AAID Le ~ thLe ^ t fr scor e probability 



AI 7 503000984_3u2blSblJ:3_4i2 | |7 14 | [ 4486 | |159 | |52 | p | |0 , 027 



Description 

sp: [LN:Y021_BPHP1] [AC:P51723] [OR : BACTERIOPHAGE HP1] [DE : HYPOTHETICAL 19.2 KD PROTEIN 
IN REP-HOL INTERGENIC REGION (ORF21) ] [SP:P51723] [DB : swissprot] >pir : [LN : S6952 7] 
[AC:S69527 ] [PN : hypothetical protein 21] [OR:phage HP1] [DB:pir2] >gp : [GI : gl046248] 
[LN:BHU24159] [AC : U24159 : U06847 : M28366 :M12911 :M22941 :M12910 :M15313] [OR : Bacteriophage 
HP1] [DB:genpept-phg] [DE : Bacteriophage HP1 strain HPlcl, complete genome.] [NT:orf21] 
[LE: 17028] [RE: 17528] [Dlidirect] 

" — — ■ ..nil..— ik— ■ . -„m,w- , — .,„—. . — ,^ ., „.. „.„„,,... ,,.i..„.... ,. nmm . hum..,,.,.,.,,, . lt ,.^ F ^ MF ii , n , .. ^ — ~- n MMt ^^^ M | l ^^^^^^^^ M ^ MMM , |||M|I , n||||||||Mmn|| -^ 

NT AA 

ORF Name NTID AAID — _„ — _ score probability 
LengthLength ^ dL 



p75030u0984_3u27i882_c3_6i8 | |7i5 | [4487 || 720 ~[ |235 | | 268 | |3.0e-23 

Description 

gp: [GI:g490316] [LN:A02585] [AC:A02585] [GN:bioD] [OR : synthetic construct] 
[DB:genpept-pat] [DE : Synthetic (LORF1) bioD gene.] [LE:39] [RE: 764] [DI: direct] 

NT AA 

ORF Name NTID AAID — ^ r — j , score probability 
LengthLength 



&I7S03000984_30272531J:2_228 | [715 | [4488 |p^~| [47 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ — , , score probability 
Length Length e - 



AIVS03000984_3035i677_c3_!>98 | [717 [ [4489 |p^~| [ 25 7 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ T — , , score probability 
LengthLength c J - 

|AE7B03000S84_30742307_c2_S72 | [718 | [4490 |222 || 640 || i.le-62 1 

Description 

pir: [LN:F69670] [AC:F69670 ] [PN:glycine betaine/carnitine/choline ABC transporter 
(membrane p) opuCD] [GN:opuCD ] [CL: glycine betaine/carnitine/choline ABC transporter] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g2271392] [LN: AF009352] [AC : AF0093 52 ] 
[PN: transmembrane protein] [GN:opuCD] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE: Bacillus subtilis osmoprotectant transport system OpuC includingATPase (opuCA) , 
transmembrane protein (opuCB) , osmoprotectantbinding protein precursor (opuCC) and 
transmembrane protein (opuCD) genes , complete cds . ] [NT: OpuCD; part of the 
osmoprotectant transport system] [LE:3627] [RE: 4316] [DI: direct] 
>gp: [GI:ell86068:g2635893] [LN : BSUB0018 ] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCD] [FN: high affinity transport of glycine 
betaine,] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbB] 
[LE : 67059] [RE : 67748] [DI : complement] 

NT AA 

ORF Name NTID AAID — — . , score probab ility 
LengthLength 

AI7503000984_3125587_tl_li4 | [719 | [4491 |pf2~| p — 



Description 
NO-HIT 



231 



ORF Name 



LAI7503000984 31287bl3 cl 464 



Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



"j |720 | [4492 |p^~| [49 | 



ORF Name 



NTID 



AAID 



F 



IV503000984 316^552 c3 597 



][ 



TIT 



NT AA 
Length Length 
|4493 | |320 



score 



probability 
1.4e-32 



Description 

gp: [GI:g3800819] [LN: AF076683] [AC : AF076683 ] [PN : oligopeptide transporter putative 
membrane] [GN:opp-lB] [OR : Staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:1537] [RE:2472] [DI:direct] 



ORF Name 



|A17b03000984 3174197 £2 222 



Description 
NO-HIT 



NTID 



][ 



TIT 



NT AA 
AAID — , — score 
LengthLength 



probability 



[4494 |pTD | |TTT5~ 



ORF Name 



XTmTT> „ T „ NT AA 

NTID AAID — — , , score 

LengthLength 



probability 



AI7503000984 32609682 El 403 



Description 
NO-HIT 



] I | 4495 |P^~I1 42 1 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



UC7503000984 32615677 c2 544 



"j |724 | [4496 |pnT]p44 ||6.7e-49 



Description 

pir : [LN:A69756] [AC:A69756 ] [PNiadhesion protein homolog ycdH] [GNrycdH ] [CLradhesin 

B] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl023108 : g2415736] [LN : AB000617] 
[AC:AB000617] [PN : YcdH] [GN:ycdH] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: 168 trpC2) DNA] [DB : genpept -bet 1] [DE:Bacillus subtilis genomic DNA, 22 to 25 

degree region, completecds . ] [NT : homologue of adhesion protein precursor of] [LE: 21421] 
[RE:22380] [Dlrdirect] >gp : [GI :ell82237:g2 632571] [LN :BSUB0002] [AC : Z99105 : AL009126] 
[GNrycdH] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DErBacillus subtilis 

complete genome (section 2 of 21): from 194651 to415810.] [NT:similar to adhesion 

protein] [LE:113236] [RE:114195] [DI:direct] 



ORF Name 



|AI7b03000984 3275b t2 185 



Description 
NO-HIT 



NTID 



31 



TIT 



7V , TT , NT AA 

AAID — i — , , score 

LengthLength 



probability 



|4497 pr 



ORF Name 



AI7S03000984_3324iS62_t2_i47 

Description 

NO-HIT 



NT AA 
NTID AAID — _ „ — , - score 
LengthLength 



probability 



] | 726 | | 4498 ||144 | pT 



232 



ORF Name 



NT ID 



NT AA 
AAID — , _ — , . score 
LengthLength 



IAI75030009U4 JJT7T7T5 c3 634 



j |727 | [4499 ||1401||466 |pT~ 



probability 
l.Be-39 



Description 

gp: [GI:el429016:g4753872] [LN:SCH10] [AC : AL04 9754] [PN:putative transmembrane efflux 
protein] [GN : SCH10 . 26c] [OR : Streptomyces coelicolor] [DB :genpept-bctl] [DE : Streptomyces 
coelicolor cosmid H10.] [NT : SCH10 . 26c, probable transmembrane efflux protein,] 
[LE : 22729] [RE : 24183] [DI : complement] 



ORF Name 



IAI7503000984 34171927 ci 468 



Description 
NO-HIT 



ORF Name 



[AI7503000984 34277062 c3 602 



Description 
NO-HIT 



NT AA 
NTID AAID — — , , sco re 
LengthLength 

|728 | [ 4500 || 47 | 



probability 



NTID 



rrzr 



„ Tn NT AA 

AAID — _ — , , score 

LengthLength 

|4501 [[151 [ |B0 | 



probability 



ORF Name 



1AI7503000984 34408552 ri 21 



Description 
NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



ORF Name 



p7503000984_34429837_il_39 



NTID AAID . 



][ 



TJT 



NT AA 
LengthLength 



score probability 
769 ||2.4e-76 



Description 

Sp: [LN:ARCL_ECOLI] [AC:Q46807] [GNrYQEA] [OR : ESCHERICHIA COLI] [DE : CARBAMATE 
KINASE-LIKE PROTEIN 1] [SP:Q46807] [DB : swissprot] >pir : [LN: B65071] [AC:B65071 ] 
[PN: hypothetical protein b2874] [CL : carbamate kinase] [OR : Escherichia coli] [DB:pir2] 
>gp: [GI:g887824] [LN: ECU28375] [AC:U28375] [OR : Escherichia coli] [DB : genpept-bctl] 
[DE : Escherichia coli K-12 genome; approximately 64 to 65 minutes.] [NT : ORF_o310] 
[LE:21276] [RE:22208] [DI:direct] >gp : [GI : gl789238] [LN : AE0003 70] [AC : AE0003 70 :U00096] 
[PN: putative kinase] [GN:yqeA] [FN: putative enzyme; Not classified] [OR : Escherichia 
coli] [DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 260 of 400 of the 
completegenome. ] [NT:o310; This 310 aa ORF is 45 pet identical (21 gaps)] [LE:11360] 
[RE: 12292] [DI :direct] 



ORF Name 



NTID 



^ n NT AA 
^ LengthLength 



probability 



IAI7503000984 34631527 tl 109 



] I 732 I | 4504 1P^~|1 113 - |pT7r"||l-7e-34 



Description 

gp: [GI:dl045996:g5360820] [LN:D86934] [AC:D86934] [PN : transposase] [OR : Staphylococcus 
aureus] [SR : Staphylococcus aureus (strain :N315) DNA, clone_lib : library of N31] 

[DB:genpept] [DE : Staphylococcus aureus genes, mec region, partial and complete cds . ] 

[NT : ORF N026; putative] [LE:19527] [RE:19751] [DI:direct] 



233 



NT AA 

0RF Name »™ ™ Le^thLemth ^^ probability 



{AI7S030009fl4J*4fc4213b_c3_g63 | | 733 | |4b05 |pTS~~| | S1 ||79 || 0.020 — 

Description 

gp: [GI:g488889] [LN:A12521] [AC:A12521] [PNrAcidic Basic Repeat Antigen Rhoptry (ABRA) ] 
[OR: Plasmodium falciparum] [SRrmalaria parasite P. falciparum] [DB :genpept-pat] 
[DE:Agl89 clone J [LE:1:61] [RE: 45: 963] [DI : direct Join] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A:t7g030009Q4_34g57g77_c3_gi5 | [734 | |45Qg | |2?5T] [ 337 | [ 2435 || o »9e-253 



Description 

gp: [GI:g4185565] [LN: AF115379] [AC : AF1153 79] [PN:surface protein Pis] [GN:pls] 
[OR staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus aureus surface protein 
Pis (pis) gene, complete cds . ] [NT :methicillin resistant; contains a DS repeat area] 
[LE:1] [RE: 4914] [DI: direct] 



NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c JL 



AI7B03000984_3S3i275fe_t2_l!>l | |735 | |4S0 7 | p^B~| [ 64 | 



Description 
NO-HIT 



ORF Name 



NTID 



E 



17503000984 35331905 c3 G12 



NT AA 

AAID — , , 

LengthLength 

1T5" 



score 



ST 



probability 
0.012 



Description 

gp: [GI:g5052610] [LN: AF145660] [AC : AF145660] [PN : BcDNA . GH10614 ] [GN : BcDNA. GH10614] 
[OR:Drosophila melanogaster] [SR: fruit fly] [DB : genpept- inv2] [DE : Drosophi la 

melanogaster clone GH10614 BcDNA. GH10614 (BcDNA. GH106 14 ) mRNA, complete cds.] [LE:14] 
[RE: 964] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



^17503000934 36042152 c3 gig 



][ 



7T7~ 



[4509 |p^F]|1081 ||23Qg ||1.6e-251 



Description 

gp: [GI:g4185565] [LN : AF115379] [AC : AF115379] [PN: surface protein Pis] [GN:pls] 
[OR : Staphylococcus aureus] [DB : genpept -bct2] [DE : Staphylococcus aureus surface protein 

Pis (pis) gene, complete cds.] [NT :methicillin resistant; contains a DS repeat area] 
[LE:1] [RE:4914] [DI:direct] 



234 



NT AA 

ORF Name NT ID AAID Le ~ thLe ^ th SCQ re probability 



A17S03000984Jifcljb7b2_tlJ>0 | |738 | [ 4510 | | 1128 | |375 |J 1 7 2 | | i.7e-10 | 



Description 

sp: [LN:YPDA_BACSU] [AC:P50736] [GN : YPDA] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 36.3 

KD PROTEIN IN RECQ-CMK INTERGENIC REGION] [SP:P50736] [DB : swissprot] >pir : [LN : A69934] 
[AC:A69934 ] [PN : thioredoxin reductase homolog ypdA] [GN : ypdA ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell83740 :g2634713] [LN:BSUB0012] [AC : Z99115 : AL009126] [GN : ypdA] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 12 of 21): from 2195541to 2409220.] [NT:similar to thioredoxin 

reductase] [SP:P50736] [LE:204681] [RE:205655] [DI : complement] 

>gp: [GI:ell85564:g2634730] [LN : BSUB0013 ] [AC : Z99116 : AL009126] [GNrypdA] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NT:similar to thioredoxin reductase] [SP:P50736] 
[LE:4961] [RE:5935] [DI : complement] >gp : [GI :gll46207] [LN : BACSERA] [AC:L47648] 
[GN: ypdA] [OR:Bacillus subtilis] [DB : genpept-bct2] [DE:Bacillus subtilis 
phosphoglycerate dehydrogenase (serA) , ypaA, f erredoxin (fer) , ypbB, recS, ypbD, ypbE, 
ypbF, ypbG, ypbH, glutamate dehydrogenase (ypcA) , ypdA, ypdB, ypdC, spore cortexlytic 
enzyme (sleB) , ypeB, ypfA, ypfB, cytidine monophosphatekinase (cmk) , ypfD, ypgA, yphA, 
yphB, yphC, NAD+ dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF 
genes, complete cds . ] [NT : putative] [LE: 10742] [RE: 11716] [DI: direct] 

_ NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

}AI76030005fl4_36601703_ci_467 | |739 | | 45ii | p3~| | 312 | || 7 . Oe-38 J 

Description 

sp: [LN:APBE_TREPA] [AC:083774] [GN : APBE : TP0796] [OR : TREPONEMA PALLIDUM] [DE : THIAMINE 
BIOSYNTHESIS LIPOPROTEIN APBE PRECURSOR] [SP:083774] [DB : swissprot] >pir : [LN : C71281] 

[AC:C71281 ] [PN : conserved hypothetical protein TP0796] [GN:TP0796 ] [CL : hypothetical 
protein HI0172] [OR : Treponema pallidum subsp. pallidum] [SR:, syphilis spirochete] 

[DB:pir2] >gp: [GI : g3323101] [LN : AE001250] [AC : AE0012 50 : AE000520] [PN : conserved 
hypothetical protein] [GN:TP0796] [OR : Treponema pallidum] [DB :genpept-bct2] 

[DE : Treponema pallidum section 66 of 87 of the complete genome.] [NT: similar to 
GB:L42023 SP:P44550 PID:1003244] [LE:10082] [RE: 11170] [DI : complement ] 

NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength ^ L - 

|M7503000984_3939215_t2_247 | |740 | [ 4512 | p^"[ | 32 7 | [T7T^]| 8 . 3e-3b "| 

Description 

gp: [GI:g2766193] [LN: SHU75349] [AC:U75349] [PN : periplasmic- iron-binding protein BhiC] 
[GN:bhi operon] [OR : Brachyspira hyodysenteriae] [DB :genpept-bct2] [DE : Serpulina 
hyodysenteriae bhi operon, complete sequence.] [LE:1674] [RE: 2693] [DI: direct] 



235 



ORF Name 



NTID 



NT 



AA 



AAID _ , 



LengthLength 



score 



[AI7503000984 394^408 c3 655 



"J |741 | [4513 ||777 ||25fl | pT5" 



probability 
3.ie-28 



Description 

sp: [LN:XYNC_CALSA] [AC:P23553] [GN : XYNC] [OR : CALDOCELLUM S ACCHAROL YT I CUM ] 
[SR : , CALDICELLULOSIRUPTOR SACCHAROLYTICUS] [EC : 3 . 1 . - . -] [DE : ACETYL ESTERASE, 
(ACETYLXYLOSIDASE) ] [SP:P23553] [DB : swissprot] >pir : [LN : B37202] [AC:B37202 ] 
[PN:acetylesterase, (XynC) ] [OR: Caldocellum saccharolyticum] [EC:3.1.1.6] [DB:pir2] 
>gp: [GI:gl44297] [LN : CDCXYNAB] [AC:M34459] [OR : Caldicellulosiruptor saccharolyticus] 
[SR:C. saccharolyticum DNA, clone pNZ1400] [DB : genpept -bctl] [DE : C. saccharolyticum 
xylanase A (XynA) , beta-xylosidase (XynB) andacetyl esterase (XynC) genes, complete 
cds.] [NT:acetyl esterase (XynC)] [LE:1257] [RE:2057] [DI:direct] >gp : [GI : g2645420] 
[LN:AF005383] [AC : AF005383] [PN : acetylxylosidase] [GN:XynC] [OR : Caldicellulosiruptor 
saccharolyticus] [DB :genpept-bct2] [DE : Caldicellulosiruptor saccharolyticus putative 
transport protein (XynG) , putative transport protein (XynH) , xylanase (XynF) , xylanase 
(XynE) , xylanase (XynD) , xylanase (XynA), acetylxylosidase (XynC) and xylanase (XynB) 
genes, complete cds.] [LE: 13673] [RE: 14473] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



IAI7503000984 4062925 ±2 24$ 



] | 742 | | 4514 IP^HP 0 II 103 |P-fle-07 



. ] 

Description 

gp: [GI:g2072447] [LN: LLU93364] [AC:U93364] [PN:EpsJ] [GN:epsJ] [OR : Lactococcus lactis 

subsp. cremoris] [DB : genpept -bct2 ] [DE : Lactococcus lactis cremoris plasmid pNZ4000 

insertion sequenceIS982 putative transposase gene and eps gene 

cluster (epsRXABCDEFGHIJKL) , complete cds.] [LE:10209] [RE:11399] [DI:direct] 

>gp: [GI:g2072447] [LN : AF036485] [AC : AF036485 : AF036486 : AF03648 7 : U93364 ] [PN:EpsJ] 

[GN:epsJ] [OR : Plasmid pNZ4000] [DB:genpept] [DE : Plasmid pNZ4000, complete sequence.] 

[LE : 16729] [RE : 17919] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID ^ v 

LengthLength 



score 



probability 



IAI7503000984 4072680 tl 75 



] | 743 1 | 4515 — | |2B7 |[?T1~[ 



8.2e-28 



Description 

pir: [LN:A70039] [AC:A70039 ] [PN:ABC transporter (ATP-binding protein) homolog yvfR] 
[GN:yvfR ] [CL: ATP-binding cassette homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell86097:g2635922] [LN:BSUB0018] [AC : Z99121 : AL009126] [GN:yvf R] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 
18 of 21): from 3399551to 3609060.] [NT:similar to ABC transporter (ATP-binding 
protein)] [LE:97865] [RE:98770] [DI : complement] >gp : [GI : e313073 :gl945718] [LN:BSZ94043] 
[AC:Z94043] [PN : hypothetical protein] [GN:yvf R] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic DNA fragment (88 kb) . ] [NT:probable ABC 
transporter] [LE: 82901] [RE: 83806] [DI:direct] 



ORF Name 



NTID 



IAI7503000984 4079382 t3 313 



, T _ NT AA 

AAID 

LengthLength 

] | 744 | | 4516 |p 43 | 



score 



probability 



Description 
NO-HIT 



ORF Name 



AI7503000984_4094052_±3_330 

Description 

NO-HIT 



NTID 



7*3" 



tvtvt^ NT ^ 

AAID — _ — . score 

LengthLength 

] |4517 |P^~| |72 | 



probability 



236 



ORF Name 



NTID 



NT AA 
AAID — , _ — ^ score 
LengthLength 



probability 



IAI75030009B4 409HJHb tl 115 



] | 745 | | 451fl IP^Hl 42 I 



Description 
NO-HIT 



ORF Name 



^17503000384 41010(53 ti 113 



NTID 



NT AA 
LengthLength 
] [ 4515 || 396 "| [T?T 



score 



probability 
JTTTTT1 



Description 

pir : [LN:T02638] [AC:T02638 ] [PN:G2 protein homolog] [GN:g2 ] [ORrDictyostelium 
discoideum] [DB:pir2] >gp : [GI :g3068587] [LN : AF000580] [AC : AF000580] [PN:G2-like] 

[GN:g2] [ORrDictyostelium discoideum] [DB : genpep t - invl ] [DE : Dictyostelium discoideum 
plasmid Ddp5, complete genome.] [NT: similar to plasmid Ddpl g2/g3/d4 protein; possible] 

[LE: 11232] [RE:12167] [DI:direct] 



ORF Name 



^17503000984 4101640 c2 545 



NTID 



73TT 



AAID 



NT 



AA 



LengthLength 



score 



[4520 [1017 |[SF0~ 



probability 
.9e-6S 



Description 

sp: [LN:YQIG_BACSU] [AC:P54524] [GNzYQIG] [OR:BACILLUS SUBTILIS] [EC:1. -.-.-] 

[DE : PROBABLE NADH- DEPENDENT FLAVIN OXIDOREDUCTASE YQIGJ [SP:P54524] [DB : swissprot] 

>pir: [LN:C69961] [AC:C69961 ] [PN : NADH -dependent flavin oxidoreductase homolog yqiG] 

[GNiyqiG ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013261 :gl303926] [LN:BACJH642] 

[AC:D84432 :D82370] [PNrYqiG] [OR:Bacillus subtilis] [SR:Bacillus subtilis 

(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 

region containing skin element.] [LE: 202096] [RE: 203214] [DI : complement] 

>gp: [GI:ell85689:g2634855] [LN: BSUB0013] [AC : Z99116 : AL009126] [GN:yqiG] [FNrunknown] 

[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 

13 of 21): from 2395261to 2613730.] [NT:similar to NADH -dependent flavin 

oxidoreductase] [SP:P54524] [LE:120407] [RE:121525] [DI:direct] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score 



IAI7503000984 4705053 c2 



7TT 



[4521 ||I72Tr||406 |[^T" 



probability 
5.8e-43 



Description 

Sp: [LN : BIOF_HAEIN] [AC:P44422] [GN : BIOF : HI1553 ] [OR : HAEMOPHILUS INFLUENZAE] 
[EC:2.3.1.47] [DE : LIGASE) ] [SP:P44422] [DB : swissprot] >pir : [LN :D64129] [AC:D64129] 
[PN: 8-amino-7-oxononanoate synthase homolog] [CL : 5-aminolevulinate synthase] 
[OR: Haemophilus influenzae] [DB:pir2] >gp: [GI :gl574397] [LN:U32830] [AC :U3 2830: L4 2023] 
[PN: 8-amino-7-oxononanoate synthase (bioF) ] [GN:HI1553] [OR : Haemophilus influenzae Rd] 
[DB:genpept-bct2] [DE Haemophilus influenzae Rd section 145 of 163 of the 
completegenome . ] [NTisimilar to SP:P53556 PID:1277027 PID:2293185] [LE:7529] [RE:8671] 
[DI : complement] 



ORF Name 



NTID 



„ TO NT AA 

AAID — _ — , score 

LengthLength 



probability 



AI7503000984_4722131_t2_268 

Description 

NO-HIT 



4522 \\TZT 



237 



NT AA 

ORF Name NTID AAID Le ^ t hLength score probability 



AI75030009tt4_476bJ:2_24b | | 751 | [4523 |p21""||106 |p ||0.0025 



Description 

gp: [GI:g5306152] [LN : AF160864] [AC : AF160864] [PN:haem lyase] [GN:yejR] 
[OR: Mitochondrion Tetrahymena pyriformis] [SR : Tetrahymena pyriformis] [DB:genpept] 
[DE : Tetrahymena pyriformis mitochondrial DNA, complete genome.] [NT: ATA initiation 
codon; ATG codon 128 nt downstream] [LE: 18212] [RE: 19750] [DI : complement] 

NT AA 

ORF Name NTID AAID — ^.^ — , , score pro bability 
LengthLength c 



[AI7503000984 4773518 c3 SS4 ] |752 | [4524 |pr~| |234 | p3 | |i.6e-S4 



Description 

gp: [GI:e312302 :gl944618] [LN: SEGAPLPP] [AC:Y12602] [PN:acid phosphatase] [GN:lppC] 
[FN:cell membrane lipoprotein] [OR : Streptococcus equisimilis] [DB :genpept-bctl] 
[DE: Streptococcus equisimilis gapC and lppC genes.] [LE:1390] [RE:2247] [DIrdirect] 



NT AA 

ORF Name NTID AAID — _„ — , , score probabil ity 
LengthLength L 



A17S03000Sfl4_4782963_cl_460 | |7S3 | [ 4525 |p5T] | 454 | | il05 | | 6 ■ 0e-il2 



Description 

pir: [LN:B70316] [AC:B70316 ] [ PN : DAPA aminotransferase] [GN : bioA ] 
[CL:beta-alanine- -pyruvate transaminase] [OR:Aquifex aeolicus] [DB:pir2] 

>gp: [GI:g2982887] [LN : AE000676 ] [AC : AE000676 : AE000657] [PN:DAPA aminotransferase] 

[GN : bioA] [OR:Aquifex aeolicus] [DB :genpept-bct2] [DE:Aquifex aeolicus section 8 of 109 

of the complete genome.] [LE: 10739] [RE: 12100] [D I : complement] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c L - 



A17B03000984_4797i77_c2_B66 | |754 | [ 4525 |[5gB~j| i95 | pJ3~" || 1 . 7e-2 7 



Description 

gp: [GI :dl011096 :gl001205] [LN: SYCSLLE] [AC : D64003 : AB001339] [PN : hypothetical protein] 
[OR:Synechocystis sp.] [SR : Synechocystis sp. (strain : PCC6803 ) DNA] [DB :genpept-bctl] 
[DE:Synechocystis sp. PCC6803 complete genome, 22/27, 2755703-2868766.] 
[NT:ORF_ID:slr0895] [LE:7468] [RE: 8031] [DI:direct] 

NT AA 

ORF Name NTID AAID — _ — _ score probabil ity 
LengthLength c 



A17B030009a4_4738202_c3_623 | |755 | |4S27 ||815"] [271 ||103 ||0.023 



Description 

pir: [LN:B71605] [AC:B71605 ] [PN: hypothetical protein PFB0850c] [GN:PFB0850c ] 
[OR Plasmodium falciparum] [DB:pir2] >gp : [GI : g3845292] [LN : AE001420] 
[AC:AE001420 :AE001362] [PN : hypothetical protein] [GN: PFB0850c] [OR : Plasmodium 
falciparum] [SR:malaria parasite P. falciparum] [DB : genpept - inv2 ] [DE : Plasmodium 
falciparum chromosome 2, section 57 of 73 of thecomplete sequence.] [NT : predicted by 
GlimmerM] [LE: 11963] [RE: 14386] [DI : complement] 



NT AA 

ORF Name NTID AAID — — _ score proba bility 
LengthLength 



IAI7503000984 4800077 t3 3S0 1 



Description 
NO-HIT 



] | 756 | | 4528 



1155 I 



238 



ORF Name 



NT AA 
NT ID AAID — , — t , score 
LenqthLength 



probability 



IA17503000984 4859628 c3 603 



J |757 | [4529 ||195 | pr 



Description 
NO-HIT 



NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



V17503000984 4867842 c3 £52 



751T 



[4530 [pS - 1|325 ||486 [|2.3e-46 



Description 

pir: [LN:E69400] [AC:E69400 ] [PN : 3 -hydroxyacyl-CoA dehydrogenase (hbd-8) homolog] 
[OR:Archaeoglobus fulgidus] [DB:pir2] >gp : [GI : g2649379] [LN : AE001021] 
[AC:AE001021:AE000782] [ PN : 3 - hydroxyacyl - CoA dehydrogenase (hbd-8)] [GN:AF1206] 
[ OR : Ar chaeog lobus fulgidus] [DB : genpept -bct2 ] [DE : Archaeoglobus fulgidus section 86 of 

172 of the complete genome.] [NT: similar to PID: 1055222 SP:P52041 percent identity:] 
[LE:9126] [RE: 10073] [DI : complement] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



Ar7503000984_4884812J:3_3i4 | [759 | [ 4531 | pFT| | 4 18 | | 579 || l.ie-78 



Description 

gp: [GI :el358508 :g3980137] [LN : LM034616] [AC : AJ007319] [PN : succinyl -diaminopimelate 
desuccinylase] [GN: dapE] [OR: Listeria monocytogenes] [DB : genpept-bctl] [DE: Listeria 
monocytogenes ascB, inlG, inlH, inlE, dapE genes.] [LE:5533] [RE:6672] [DI:direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

p7503000984_4895061_c2_561 | [760 | [^T2~[pT7T] | V56 | [2671 ||5TB e-278 | 

Description ~ — — 

pir: [LN:S01788] [AC : S01788 : B323 05 : F64829 ] [PN:formate C-acetyltransf erase , l:pyruvate 
formate-lyase I] [GN:pflB:pfl ] [CL:formate C-acetyltransf erase l:glycyl radical 
homology] [OR : Escherichia coli] [EC : 2 . 3 . 1 . 54] [DB:pirl] [MP: 20. 5 ] 
>gp: [GI:dl036624 :gl651427] [LN:D90728] [AC : D90728 : AB001340] [PN:Formate 
c-acetyltransferase (EC 2.3.1.54).] [GN:pfl] [OR : Escherichia coli] [SR : Escherichia 
coli (strain:K12) DNA, clone :Kohara clone #216] [DB : genpept-bctl] [DE : Escherichia coli 
genomic DNA. (20.4 - 20.8 min) . ] [NT:ORF_ID:o216#7; similar to PIR Accession Number] 
[LE:6965] [RE:9247] [DI : complement ] >gp : [GI : g42370] [LN : ECPFL] [AC:X08035] 
[OR Escherichia coli] [DB : genpept-bctl] [DE : E . coli pfl gene for pyruvate formate-lyase 
(EC 2.3.1.54).] [NT:pyruvate formate-lyase (AA 1-760)] [SP:P09373] [LE:101] [RE.-2383] 
[DI: direct] >gp: [GI :gl787131] [LN:AE000192] [AC : AE000192 : U00096] [PN:formate 
acetyltransferase 1] [GN :pf IB] [FN:enzyme; Energy metabolism, carbon: Anaerobic] 
[OR : Escherichia coli] [DB : genpept -bct2] [EC : 2 . 3 . 1 . 54] [DE : Escherichia coli K-12 MG1655 
section 82 of 400 of the completegenome . ] [NT:f760; 100 pet identical to PFLB_ECOLI SW: 
P09373] [LE:5588] [RE: 7870] [DI : complement] 



NT AA 

ORF Name NTID AAID — , — , , score probabil ity 
LengthLength * JL 

|AI7503000984_497409i c2 559 ] [7^1 1 



TFT 



0.00087 



Description 

pir: [LN:B71359] [AC:B71359 ] [PN : conserved hypothetical protein TP0156] [GN:TP0156 ] 
[OR:Treponema pallidum subsp. pallidum] [SR:, syphilis spirochete] [DB:pir2] 
>gp: [GI:g3322423] [LN : AE001200] [AC : AE001200 : AE000520] [PN : conserved hypothetical 
protein] [GN:TP0156] [OR : Treponema pallidum] [DB : genpept -bet 2] [DE : Treponema pallidum 
section 16 of 87 of the complete genome.] [NT: similar to GB:L42023 SP:P44679 
PID: 1003656] [LE:2984] [RE: 3388] [DI : complement] 



239 



ORF Name 



NT ID 



„ T „ NT AA 

AAID — . , — . . scor e 

LengthLengbh 



probability 



1AI7503000984 b0b301 E3 345 



Description 
NO-HIT 



J |762 j [4534 ||291 1 196 | 



ORF Name 



NT ID AAID 



NT 



AA 



AI7503000984_5109785_c3_670 



Length Length 



score 



] I 763 | [4535 ||255 ||84 | [75 



probability 
[uTuTTS 



Description 

gp: [GI:g4103231] [LN: AF021085] [AC : AF021085] [PN : cytochrome b] [GN:cytb] 
[OR: Mitochondrion Edaphus sp.] [SRrEdaphus sp] [DB : genpept - inv2 ] [DE:Edaphus sp . 
cytochrome b (cytb) gene, mitochondrial gene encodingmitochondrial protein, partial 
cds.] [LE:<1] [RE:>465] [DI: direct] 



ORF Name 



NT 



AA 



NT ID AAID , _ , 

LengthLength 



score 



1AI7503000984 5113413 c3 653 



][ 



[4535 |[^~| piT" 



probability 
j |3. Je-b6 



Description 

pir: [LN:D69670] [AC:D69670 ] [PN:glycine betaine/carnitine/choline ABC transporter 
(membrane p) opuCB] [GNropuCB ] [CLiglycine betaine/carnitine/choline ABC transporter] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI :g2271390] [LN: AF009352] [AC : AF009352] 
[PN: transmembrane protein] [GN:opuCB] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis osmoprotectant transport system OpuC includingATPase (opuCA) , 
transmembrane protein (opuCB) , osmoprotectantbinding protein precursor (opuCC) and 
transmembrane protein (opuCD) genes , complete cds.] [NT: OpuCB; part of the 
osmoprotectant transport system] [LE:2025] [RE: 2678] [DI:direct] 
>gp: [GI:ell86070:g2635895] [LN : BSUB0018] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCB] [FN: high affinity transport of glycine 
betaine,] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 18 of 21) : from 3399551to 3609060.] [NT : alternate gene name: yvbD] 
[LE: 68697] [RE: 69350] [DI : complement] 



ORF Name 



NT ID 



„™ NT AA 

AAID — — score 

LengthLength 



probability 



1A17503000984 516092b cl 443 



Description 
NO-HIT 



] | 765 1 [4537 |p^~ || 63 | 



ORF Name 



NTID 



7V7VTr , NT AA 

AAID — — . score 

LengthLength 



IAI7503000984 5189037 c3 620 



7TT 



T5T" 



probability 
4.7e-32 



Description 

pir: [LN:H64461] [AC:H64461 ] [PN : 6-carboxyhexanoate- -CoA ligase , ] 

[CL: 6-carboxyhexanoate- -CoA ligase bioW] [OR:Methanococcus jannaschii] [EC : 6 . 2 . 1 . 14] 
[DB:pir2] [MP:FOR1244632-1245345 ] >gp : [GI : gl591935] [LN:U67570] [AC :U67570 : L77117] 
[PN:6-carboxyhexanoate-CoA ligase (bioW) ] [GN:MJ1297] [OR : Methanococcus jannaschii] 
[DB: genpept -bct2] [DE : Methanococcus jannaschii section 112 of 150 of the complete 
genome.] [NT:similar to GB:M29291 SP:P22822 PID:142594 percent] [LE:6098] [RE:6811] 
[DI:direct] 



ORF Name 



I A1 7 503000984 5860630 c2 564 



NTID 

1 FZ~ 



„ TO NT AA 

AAID — — score 

LengthLength 



probability 



Description 
NO-HIT 



[4539 |[X7T | |56 | 
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ORF Name 



U.7503000984 60bfebfeV cl 490 



NT AA 

™ ^ LengthLength probability 
] | 768 1 | 4540 |pT"]|192 | p20 || 9.1e-29 



Description 

pir: [LN:D64866] [AC:D64866 ] [PN : hypothetical protein bll99] [OR : Escherichia coli] 
[DB:pir2] >gp : [GI :gl787449] [LN : AE000218] [AC : AE000218 : U00096 ] [PNiputative 

dihydroxyacetone kinase (EC 2.7.1.2)] [GN:bll99] [FN: putative enzyme; Not classified] 
[OR: Escherichia coli] [DB :genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 108 of 

400 of the completegenome . ] [NT:f210; 30 pet identical (16 gaps) to 181 residues] 
[LE:4528] [RE: 5160] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



,. — m _ — .score probabi lity 
LengthLength c JL 



IAI7503000984 S25262 cl 430 



] | | 4541 |pS~|pg |p j|0.0i3 



Description 

gp: [GI:g765037] [LN : DROMTTRND] [AC:M18022] [PN:NADH dehydrogenase subunit 2] 
[OR: Mitochondrion Drosophila melanogaster] [SR: fruit fly] [DB : genpept - inv2 ] 
[DE:D.melanogaster Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase subunit 2 
cytochrome oxidase subunit 1 (5' end) genes.] [LE:<1] [RE: 462] [DI: direct] 



(3» end) 



ORF Name 



NTID 



NT 



AA 



AAID . . , . 



LengthLength 



score 



probability 



IAI7503000984 5302217 c3 638 



] |770 | [4b42 |p3~ ||315 ||133B ||2 ■ Se-136 



Description 

gp: [GI:g3800819] [LN: AF076683] [AC : AF076683 ] [PN : oligopeptide transporter putative 
membrane] [GN:opp-lB] [OR : Staphylococcus aureus] [DB : genpept -bct2] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:1537] [RE:2472] [DI:direct] 



ORF Name 



_ _ _ _ _ _^ NT AA 

NTID AAID — — , _ score 
LengthLength 



probability 



IAI7503000984 6725817 c3 658 



] |77i | [4543 ||165~| p 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength ' 



V17503000984 7072825 cl 435 



][ 



HIT 



score probability 
[4544 |[T7^-||257 ||"459 ||i.7e-43 



Description 

gp: [GI:g4980796] [LN: AE001711] [AC : AE001711 : AE000512] [PN: oxidoreductase , short chain] 
[GN:TM0297] [OR : Thermo toga maritima] [DB : genpept -bet 2] [DE : Thermotoga maritima section 
23 of 136 of the complete genome.] [NT: similar to SP:P50167 PID: 763164 percent 
identity:] [LE:7161] [RE: 7934] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — „ — _ score 
LengthLength 



probability 



IAI7503000984 800300 cl 458 



] I 773 I [4545 |p3— 1| 194 |pT 3 [ |5.0e-28 



Description 

pir: [LN:S23693] [AC:S23693 ] [PN : erythrocyte membrane-associated antigen (clone pPf 
K19)] [OR: Plasmodium falciparum] [DB:pir2] 



241 



ORF Name 



NT AA 
NTID AAID — « _ — , . score 
LengthLength 



probability 



] |7 74 | [4546 |473 |[1U8 ||i.3e-TTT" 



|AI7B030009B4JU414U_cl_4J7 

Description 

sp: [LN:UHPT_ECOLI] [AC : P13408 : P76727] [GN : UHPT] [OR : ESCHERICHIA COLI] [DE : HEXOSE 
PHOSPHATE TRANSPORT PROTEIN] [SP : P13408 : P7672 7] [DB : swissprot] >pir : [LN : MMECHP] 
[AC:A30395:H41853 :C65168 : Q00500 : S3 007 9 ] [PN:hexose phosphate transport protein uhpT] 
[GN : uhpT ] [CL:hexose phosphate transport protein uhpT] [OR : Escherichia coli] [DBrpirl] 
[MP:82 min ] >gp : [GI : gl48115] [LN:ECOUHP] [AC:M17102] [OR : Escherichia coli] [SR:E.coli 
DNA, clone pRJKlO] [DB : genpept-bctl] [DErE.coli uhp operon encoding UhpA, UhpB, UhpC, 
and UhpT protein, (encoding hexose phosphate transport protein) , complete cds, and 
anilvBN operon encoded protein, 3' end.] [NTrhexose phosphate transport protein UhpT] 
[LE:3722] [RE:5113] [DI:direct] >gp : [GI : gl4812 0] [LN: ECOUHPABCT] [AC:M89479] 
[PN:hexosephosphate transport protein] [GNiuhpT] [OR : Escherichia coli] [SR : Escherichia 
coli DNA] [DB:genpept-bctl] [DE : Escherichia coli uhpABCT operon encoding 
hexosephosphateutilization protein (uhpA) gene, complete cds, and 

hexosephosphatetransport protein (uhpB, uhpC, uhpT) genes, complete cds.] [LE:3722] 
[RE:5113] [Dlidirect] >gp : [GI :g2367259] [LN : AE000444] [AC : AE000444 :U00096] [PN:hexose 

phosphate transport protein] [GNiuhpT] [FN : transport / Transport of small molecules:] 
[OR Escherichia coli] [DB :genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 334 of 

400 of the completegenome . ] [NT: f 463; 99 pet identical amino acid sequence and] 
[LE: 5590] [RE: 6981] [DI : complement] 



ORF Name 



IAI7503000984 51525 t3 398 



Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 



score 



probability 



] I 775 1 1 4547 lP b HI 44 I 



ORF Name 



E 



17503000984 816878 c3 637 



NT AA 

NTID AAID — , , score 

LengthLength 



probability 



] | 775 1 |4548 |[T^?T]|543 | p^F]|l . Oe-225 



] 



Description 

gp: [GI:g3800818] [LN : AF076683] [AC : AF076683] [PN : oligopeptide transporter putative 
substrate] [GN:opp-lA] [OR: Staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:64] [RE: 1524] [DI: direct] 



ORF Name 



UE7S03000984 822150 T2 262 



NTID 



AAID . — . 



NT AA 
LengthLength 
] |777 | [4549 |[FT5~| [TTT 



score probability 
ET5 110.028 



Description 

gp: [GI:dl006984:g567946] [LN: PXMTGBP] [AC:D30753] [PN:21K protein] [FN: Presumably cell 
to cell movement] [OR:Potato mop-top virus] [SR:Potato mop-top virus 

(individual_isolate Todd) (library: Tb2-1] [DB .-genpept -vrl] [DE:Potato mop-top virus 
RNA for 51K protein, 13K protein, 21K proteinand 8K protein, complete cds.] 
[NT:putative] [LE:1961] [RE:2533] [DI:direct] 



ORF Name 



AI7503000984_87b765_c2_525 

Description 

NO-HIT 



NTID 



NT AA 
AAID — , — , , score 
LengthLength 



probability 



778" 
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NT AA 

QRF Name HTID AAID Le — thLe ~ th sc °re probability 



AI7B03000984_9fc91i>7_clj4a0 | |779 | [4551 ||1026 1 |341 || 220 | | 8.5e-lg 



Description 

gp: [GI:g3800818] [LN : AF076683 ] [AC : AF076683] [PN : oligopeptide transporter putative 
substrate] [GN:opp-lA] [OR : Staphylococcus aureus] [DB : genpept-bct2] [DE : Staphylococcus 
aureus oligopeptide transporter putative substratebinding domain (opp-lA) , oligopeptide 
transporter putative membranepermease domain (opp-lB) , oligopeptide transporter 
putativemembrane permease domain (opp-lC) , oligopeptide transporterputative ATPase 
domain (opp-lD) , and oligopeptide transporterputative ATPase domain (opp-lF) genes, 
complete cds; and unknowngene . ] [LE:64] [RE:1524] [DI:direct] 



NT AA 

QRF Name NTID AAID — _ T — _ score probability 
LengthLength c JL 



AI7503000384_98067i8_ci_474 | |780 | [4552 |[7S2~| p53 | |3.4e-86 



Description 

gp: [GI:g3800822] [LN : AF076683 ] [AC : AF076683] [PN : oligopeptide transporterputative 
ATPase domain] [GN:opp-lF] [OR : Staphylococcus aureus] [DB : genpept-bct2] 
[DE: Staphylococcus aureus oligopeptide transporter putative substratebinding domain 
(opp-lA) , oligopeptide transporter putative membranepermease domain (opp-lB) , 
oligopeptide transporter putativemembrane permease domain (opp-lC) , oligopeptide 
transporterputative ATPase domain (opp-lD) , and oligopeptide transporterputative ATPase 
domain (opp-lF) genes, complete cds; and unknowngene . ] [LE:4137] [RE:4892] [DI:direct] 

nam— — »™m°o< - ■ -» '■'■'■ ■ ■ ■ ■ ■ »■ ■ 11111111 ■ ■■■iiM» l i"i n i l ifc — ■■■■"■ ■ii p iiiiiiiiiiiii i ■ !■ t i tt ^ i iMa i amM^ i^^^m i ii i i m i i i i i^^i m^^^^^as imm—— 

NT AA 

QRF Name NTID AAID — — . , score probab ility 
LengthLength c L - 



AI7503000984_984g86_ci_478 | |781 | [4553 |[mT] [405 | |i!72 ||4 . 8e-119 



Description 

pir : [LN:C69670] [AC:C69670 ] [PN:glycine betaine/carnitine/choline ABC transporter 
(ATP-bindin) opuCA] [GN:opuCA ] [CL: glycine betaine /proline transport protein proV: 
ATP-binding cassette homology: CBS homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:g2271389] [LN : AF009352] [AC : AF009352] [PN : ATPase] [GN:opuCA] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE: Bacillus subtilis osmoprotectant transport system OpuC 
includingATPase (opuCA) , transmembrane protein (opuCB) , osmoprotectantbinding protein 
precursor (opuCC) and transmembrane protein (opuCD) genes , complete cds.] [NT: OpuCA; 
part of the osmoprotectant transport system] [LE:860] [RE: 2002] [DI : direct] 
>gp: [GI:ell86071:g2635896] [LN : BSUB0018 ] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCA] [FN: high affinity transport of glycine 
betaine,] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbE] 
[LE : 69373] [RE : 70515] [DI : complement] 



NT AA 

QRF Name NTID AAID — — t , score probability 
LengthLength 



AI7B03000984_9970i67J:3_393 | |782 | [4bb4 || 861 ~| |286 | prg~ || 2 . 9e-25 



Description 

pir: [LN:E69796] [AC:E69796 ] [PN : two -component response regulator [YesM] homolog yesN] 
[GN : yesN ] [CL:response regulator homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell82675:g2633009] [LN:BSUB0004] [AC : Z99107 : AL009126] [GN:yesN] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
4 of 21): from 600701 to813890.] [NT:similar to two-component response regulator 
[YesM]] [LE: 159260] [RE: 160366] [DI: direct] 

NT AA 

QRF Name NTID AAID — - T — , , score probability 
Length Length c 

[AI7503000984_9973515_cl_495 | |783 | [ 4555 | | 2bl 1 | 86 | 

Description 
NO-HIT 



243 



ORF Name 



NT AA 
NTID AAID — ^ , _ — , score 
Length Length 



probability 



p750300098b_10000183_c3_2061 | |784 | [4556 |p^5 1 155 | 



Description 
NO-HIT 



ORF Name 



NTID 



MTr% NT AA 

AAID — i _, T — , . score 

LengthLength 



probability 



E 



7503000985 1003462V El 956 



4557 Ifm 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID „ — , „ — m score probability 

LengthLength 



IAI7503000985 10195252 12 807 



][ 



[4558 | [ITS 1 141 | 



Description 
NO-HIT 



ORF Name 



IAI7503000985 10203501 c2 1757 



NTID 
j |787 



„ T1 ^ NT AA 

AAID — — score 

Length Length 

] [4555 |j!56 ||51 | 



probability 



Description 
NO-HIT 



ORF Name 



NTID 



E 



17503000985 10241433 Cl 1640 



][ 



7W 



NT AA 

AAID — 

LengthLength 

[4550 | [TZ5 | 



score 



TUT 



probability 
1.4e-05 



Description 

sp: [LN:YDGI_BACSU] [AC:P96707] [GN : YDGI] [OR:BACILLUS SUBTILIS] [EC : 1 . -.-.-] 
[DE : PUTATIVE NAD ( P ) H NITROREDUCTASE YDGI,] [SP:P96707] [DB : swissprot ] >pir : [LN : C69783 ] 
[AC:C69783 ] [ PN : NADH dehydrogenase homolog ydgl] [GNrydgl ] [CL : nitroreductase] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl020152 :gl881372] [LN : AB001488] [AC : AB001488] 
[GNrydgl] [OR:Bacillus subtilis] [SRrBacillus subtilis (strain: 168) DNA] 
[DB : genpept-bctl] [DE: Bacillus subtilis genome sequence, 148 kb sequence of the 
regionbetween 35 and 47 degree.] [NT : SIMILAR TO NITROREDUCTASE.] [LE:145410] 
[RE:146039] [DI : complement] >gp : [GI : ell82545 : g2632879] [LN: BSUB0004] 

[AC:Z99107 :AL009126] [GNrydgl] [FNrunknown] [ORrBacillus subtilis] [DB r genpept-bctl] 

[DErBacillus subtilis complete genome (section 4 of 21) r from 600701 to813890.] 

[NT r similar to NADH dehydrogenase] [SPrP96707] [LE: 11206] [RE : 11835] [D I r complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000985 10332262 tl 254 



] |789 | [4551 |p^~||121 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



, — . „ — _ score probability 
LengthLength c 



E 



17503000985 1046885 11 82 
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Description 

gpr [GIre245927rg2462047] [LNr ACRBDOXN] [AC:Z46863] [PN: polyphosphate kinase] [GNrppk] 
[ORr Acinetobacter sp. ADP1] [DB r genpept-bctl] [DE : Acinetobacter sp. cysD, cobQ, sodM, 
lysS, rubA, rubB, estB, oxyR,ppk, mtgA, ORF2 and ORF3 genes.] [NT : putative ; 
transcription of ppk is induced by] [LE:9244] [RE: 11319] [DI : complement] 



244 



ORF Name 



NT ID 



NT AA 
AAID — -i _ — score 
LengthLength 



probability 



[AI750300098b_10b8b4 32_cl_15 39 [ [791 [ [4553 |196 ||171 ||2 . ie-lB — 

Description 

Sp : [LN : GGT_BACSU] [AC:P54422] [GNrGGT] [OR:BACILLUS SUBTILIS] [EC:2.3.2.2] 
[DE : GAMMA-GLUTAMYLTRANSPEPTIDASE PRECURSOR, ] [SP:P54422] [DB : swissprot] 
>pir: [LN:F69631] [AC : F6 9631 : JC5867 : PC4504 ] [PN : gamma -glutamyl trans f erase , 
precursor : gamma glutamyl transpeptidase] [GN:ggt ] [CL : gamma -glutamyl transferase] 
[OR:Bacillus subtilis] [EC:2.3.2.2] [DB:pir2] >gp : [GI : ell83499 : g2634224] [LN:BSUB0010] 
[AC: Z99113 : AL009126] [PN : gamma -glutamyl transpeptidase] [GN:ggt] [FN : glutathione 
metabolism] [OR: Bacillus subtilis] [DB : genpept - be t 1 ] [EC: 2. 3. 2. 2] [DE: Bacillus subtilis 
complete genome (section 10 of 21): from 1781201to 2014980.] [NT : alternate gene name: 
pac] [SP:P54422] [LE:222733] [RE:224496] [DI:direct] >gp : [GI : ell85314 :g2634235] 
[LN:BSUB0011] [AC: Z99114 :AL009126] [PN : gamma -glutamyl transpeptidase] [GN:ggt] 
[FN glutathione metabolism] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.3.2.2] 
[DE:Bacillus subtilis complete genome (section 11 of 21) : from 2000171to 2207900.] 
[NT: alternate gene name: pac] [SP:P54422] [LE:3763] [RE: 5526] [DI: direct] 
>gp: [GI:gl491813] [LN : BSU49358] [AC:U49358] [PN : gamma-glutamyltranspeptidase] [GN:ggt] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis strain= JH642] [DB :genpept-bct2] 
[DE:Bacillus subtilis gamma-glutamyltranspeptidase (ggt) gene, completecds . ] [LE:250] 
[RE:2013] [Dlidirect] 



ORF Name 

IAI7503000985 10600010 c2 1931 



NT AA 
NT ID AAID ^ — _ T — , , score 
LengthLength 



probability 
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Description 
NO-HIT 



ORF Name 



NT AA 
NT ID AAID , — . — _ score 
LengthLength 



^17503000985 1060162b c3 2039 



] | 79 ^ | |4565 ||231~"] |7g 



probability 
0.0050 



] 



Description 

gp: [GI:el286089:g3036830] [LN : CJAJ0856] [AC : AJ000856] [PN:ABC transporter protein] 
[GN : kpsM] [OR Campylobacter jejuni] [DB : genpept -bet 1] [DE : Campylobacter jejuni kpsM, 
kpsT genes.] [LE:134] [RE: 916] [DI:direct] 



ORF Name 



NTID 



AA 



NT 

AAID — , — , score 
LengthLength " 



E 



17503000985 10605337 c2 1945 



probability 
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Description 

gp : [GI:g666983] [LN: BSPAAT] [AC:X77636] [PN:putative ATP binding subunit] [GN : ORF3 ] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis putative amino acid transporter 
gene.] [NT:potential ABC-transport system] [SP:P39456] [LE:1627] [RE:2370] [DI:direct] 



ORF Name 



^7503000985 1053552 t3 982 



Description 
NO-HIT 



NT AA 
NTID AAID „ — ^, , — ^, score 
LengthLength 



probability 
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Description 

NO-HIT 



NT AA 
NTID AAID , — L1 m — _ score 
LengthLength 



probability 
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ORF Name 



NTID 



NT AA 

^ Le^thLe^th ^^ probability 



IA17503000985 106b78L>7 cl 1607 



] |V97 | [4559 ||1401||466 | — 1| 1 . Ve-36 



Description 

gp: [GI:e304997:g2294506] [LN:A37836] [AC:A37836] [OR : Streptomyces pristinaespiralis] 
[DB:genpept-pat] [DE:Sequence 6 from Patent WO9408014.] [NT:unnamed protein product] 
[LE:103] [RE:1689] [Dlidirect] 



ORF Name 



IAI7503000985 10567002 c2 1553 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 
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Description 
NO-HIT 
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Description 
NO-HIT 



NTID 



NT AA 
AAID — — score 
LengthLength 

[4572 |P^~] |54 | 



probability 
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NT 
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LengthLength 
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Description 
NO-HIT 
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NTID 
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LengthLength 



score 



[AI750300098^ 1074177 c2 1838 



probability 
■J |802 | [4574 || 274 |[" 417 | | 4.8e-39 



Description 

pir: [LN:F64819] [AC:F64819 ] [PN: hypothetical protein b0822] [OR : Escherichia coli] 
[DB:pir2] >gp: [GI : dl036489 :g4062389] [LN:D90719] [AC :D90719 : ABO 01340] [PN : Hypothetical 

protein 1] [OR : Escherichia coli] [SR : Escherichia coli (strain : K12) DNA, clone :Kohara 

clone #206] [DB : genpept-bctl] [DE : Escherichia coli genomic DNA. (18.2 - 18.6 min) . ] 

[NT:ORF_ID:o207#5; similar to PIR Accession Number] [LE: 13745] [RE: 14560] 

[DI : complement] >gp: [GI : dl 036496 :g4 062396] [LN:D90720] [AC : D90720 : AB001340] 

[PN: Hypothetical protein 1] [OR : Escherichia coli] [SR : Escherichia coli (strain : K12 ) DNA, 

clone:Kohara clone #207] [DB : genpept-bctl] [DE : Escherichia coli genomic DNA. (18.4 - 

18.8 min).] [NT : ORF_ID : o207#5 ; similar to PIR Accession Number] [LE:5774] [RE:6589] 
[DI: complement] >gp: [GI :gl787043] [LN: AE000184] [AC : AE000184 :U00096] [PN:orf, 

hypothetical protein] [GN:b0822] [FN:orf; Unknown] [OR : Escherichia coli] 
[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 74 of 400 of the 

completegenome. ] [NT:f271; This 271 aa ORF is 24 pet identical (16 gaps)] [LE:6609] 
[RE : 7424] [DI : complement] 
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ORF Name 



NTID 



NT AA 
AAID — •« _ — score 
LengthLength 



probability 



IAI7503000985 107bfe925 tl 225 



803 | [4575 |preT||52i | [753~ ||6 . 9e-79 



Description 

gp: [GI:dl020925:g2116759] [LN:D86418] [AC.-D86418] [PNrYfnA] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain :AC327) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 
genomic DNA 69-70 degree region, partialsequence . ] [LE:7539] [RE:8927] [DIrdirect] 



ORF Name 



F 1 



750300098b 10538503 ti 322 



Description 
NO-HIT 
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Description 
NO-HIT 
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AAID — — , , score 
LengthLength 



probability 
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NTID AAID — _ T — , , score 
LengthLength 
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E 
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Description 

gp: [GI :g4731918] [LN : AF111944] [AC : AF111944] [PN : development protein DG1122] 
[GN:DG1122] [OR : Dictyostelium discoideum] [DB : genpept - inv2 ] [DE : Dictyostelium 

discoideum AX4 development protein DG1122 (DG1122) gene, partial cds . ] [LE: 207: 829] 
[RE : 744 : >1036] [DI : direct Join] 



ORF Name 



117503000985 11053801 tl 381 



Description 
NO-HIT 



NT AA 

NTID AAID — — , , score probability 
LengthLength 
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Description 
NO-HIT 



NTID 
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LengthLength c 
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NT 
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NTID AAID — — , , score p robability 
Lengtn Length 



^7503000985 115751 c3 2118 
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Description 

pir: [LN:S77243] [AC:S77243 ] [PN : hypothetical protein slrl363] [OR : Synechocystis sp.] 
[SR:PCC 6803, , PCC 6803] [SR:PCC 6803, ] [DB:pir2] >gp : [GI : dl018310 :gl652657] 
[LN:D90907] [AC : D90907 : AB00133 9] [PN : hypothetical protein] [GN:glgP] [OR : Synechocystis 
sp.] [SR: Synechocystis sp. (strain : PCC6803) DNA] [DB : genpept-bctl] [DE : Synechocystis 
sp. PCC6803 complete genome, 9/27, 1056467-1188885.] [NT : ORF_ID : slrl363] [LE:49200] 
[RE : 50702] [DI : direct] 
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ORF Name 



NT 



AA 



3^5 ^ LengthLength probability 



l7b030009B5 liVlbO ill 173 



■j |810 | [4582 |piT]|435 | |IgZr|| i, 9e-lB g 



Description 

gp: [GI:g4096796] [LN : SCU40157] [AC:U40157] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus condensing- enzyme- like protein (orfl) 
andSpoVE-like protein (orf2) genes, complete cds . ] [NT:orfl; unknown function; similar 
to] [LE:193] [RE:1362] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IAI7503000985 1173130 ci 1530 
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Description 

pir: [LN:S46952] [AC : S46 952 : S63605 ] [PN : phosphotransferase system enzyme II, , 
glucose- specif ic, factor IIA: glucose permease :phosphoenolpyruvate : glucose 
phosphotransferase system enzyme II, glucose- specif ic : phosphotransferase system enzyme 
II, glucose-specific, factor 1 :protein-Npi-phosphohistidine- -sugar phosphotransferase, 
glucose-specific, factor II] [GN : glcA : ptsG ] [CL : phosphotransferase system 
N-acetylglucosamine- specif ic enzyme II : phosphotransferase system glucose-specific 
enzyme II, factor II homology : phosphotransferase system glucose-specific enzyme II, 
factor III homology] [OR : Staphylococcus carnosus] [EC : 2 . 7 . 1 . 69] [DB:pirl] 
>gp: [GI:gl072418] [LN : SCGLCAB] [AC : X93360 : X80415] [GNrglcA] [OR : Staphylococcus 
carnosus] [DB : genpept-bctl] [DE : S . carnosus glcA gene and glcB gene.] [LE:540] [RE: 2567] 
[DI: direct] 



ORF Name 



NTID 



AAID 



NT 
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LengthLength probability 



VI750300098b 11756543 i2 497 
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Description 
NO-HIT 
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LengthLength 



score 



17503000985 1178593 t3 990 
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probability 
i.ie-156 



Description 

pir: [LN:C6 9794] [AC:C69794 ] [PN : glutamate synthase (ferredoxin) homolog yerD] [GN:yerD 
] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82639 :g2632973] [LN : BSUB0004] 
[AC:Z99107 :AL009126] [GN:yerD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[NT:similar to glutamate synthase (ferredoxin)] [LE:115586] [RE:117163] [DI : complement] 
>gp: [GI:ell67974:g2577963] [LN : BSYERABCD] [AC:Y15254] [PN: YerD protein] [GN:yerD] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 13kB DNA fragment, from 
yerA to sapB gene.] [NT: similar to plectonema boryanum large subunit of] [LE:3231] 
[RE: 4808] [DI : complement ] 



ORF Name 



NTID 



^ m NT AA 

AAID — , — , , score 

LengthLength 



probability 



IA17503000985 1180292 ci 1535 
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Description 

pir: [LN:F70069] [AC:F70069 ] [PN:capsular polyglutamate biosynthesis homolog ywsC] 
[GN:ywsC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84496 :g2636115] [LN:BSUB0019] 
[AC:299122 :AL009126] [GN:ywsC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: similar to capsular polyglutamate biosynthesis] [LE: 101413] [RE: 102594] 
[DI: complement] >gp: [GI : ell84496 :g2636115] [LN: BSUB0019] [AC : Z99122 : AL009126] [GNrywsC] 
[FN:unknown] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT:similar to capsular polyglutamate 

biosynthesis] [LE : 101413] [RE : 102594] [DI : complement] 
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ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000985 liybb463 c3 2075 
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Description 

pir: [LN:H69611] [AC:H69611 ] [PN : 3 ' -phosphoadenosine 5 ' -phosphosulf ate reductase, 
cysH:3 ' -phosphoadenylylsulf ate reductase, thioredoxin dependent: PAPS reductase : PAPS 
sulfotransf erase] [GNicysH ] [OR:Bacillus subtilis] [EC : 1 . 8 . 99 . 4] [DB:pir2] 
>gp: [GI:e332183:g2462956] [LN: BSPYREYLO] [AC : AJ000974] [PN:putative 
phospho-adenylylsulphate] [GN : cysH] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE: Bacillus subtilis pyrE to yloA gene region.] [LE:548] [RE: 1249] [DI: direct] 
>gp: [GI:ell85149:g2633930] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : phosphoadenosine 
phosphosulf ate] [GN : cysH] [FN: cysteine biosynthesis] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [EC : 1 . 8 . 99 . 4] [DE:Bacillus subtilis complete genome (section 9 of 
21): from 1598421to 1807200.] [LE:31361] [RE:32062] [DI: direct] 



ORF Name 



NT ID 



NT AA 
AAID — , _ — , - score 
LengthLength 



probability 



IAI7503000985 11881630 ±2 658 
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Description 
NO-HIT 
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NTID 
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Description 

sp: [LN:HIS2_HAEIN] [AC:P44434] [GN : HISI : HISIE : HI04 75] [OR : HAEMOPHILUS INFLUENZAE] 
[EC : 3 . 5 . 4 . 19 : 3 . 6 . 1 . 31] [DE : PYROPHOSPHOHYDROLASE , ] [SP : P44434] [DB : swissprot] 
>pir: [LN:A64071] [AC:A64071 ] [PN : phosphor ibosyl -AMP cyclohydrolase , / 
phosphor ibosyl -ATP pyrophosphatase,] [CL:hisI bifunctional enzyme:hisl bifunctional 
enzyme homology : hisl protein homology] [OR : Haemophilus influenzae] 

[EC:3. 5. 4. 19:3-6. 1.31] [DB:pir2] >gp: [GI :gl573454] [LN:U32730] [AC : U32 73 0 : L42023 ] 
[PN:phosphoribosyl-AMP cyclohydrolase /] [GN:HI0475] [OR : Haemophilus influenzae Rd] 
[DB:genpept-bct2] [DE Haemophilus influenzae Rd section 45 of 163 of the complete 
genome.] [NT:similar to SP:P06989 GB:D43637 GB:U02072 GB:X03974] [LE:4576] [RE:5241] 
[DI: direct] 



ORF Name 



NT AA 
NTID AAID , — _ — __ score 
LengthLength 



probability 
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Description 

NO-HIT 
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NT AA , , . , . 

0RF Name HTID AAI5 LemthLe^th ^^ probability 
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Description 

pir : [LN:B69680] [AC :B6 96 80:140510 ] [PN : para-nitrobenzyl esterase,] [GN:pnbA ] 
[CLrcholinesterase homology] [OR: Bacillus subtilis] [EC: 3. 1.1.-] [DB:pir2] 

>gp: [GI:gl762126] [LN : BSU46134] [AC:U46134] [PN : intracellular esterase B] [GN : estB] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [EC: 3. 1.1.1] [DE: Bacillus subtilis putative 

orf 1 unknown protein, putativetranscriptional regulator (sir) , and intracellular 

esterase B(estB) genes, complete cds . ] [NT: EstB; esterase of the serine -hydrolase 

family] [LE:1035] [RE:2504] [DI: direct] >gp : [GI : ell86127 : g2635952] [LN: BSUB0018] 
[AC:Z99121 :AL009126] [PN : para-nitrobenzyl esterase (intracellular] [GN:pnbA] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC:3. 1.1.1] [DE:Bacillus subtilis complete 

genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: estB] 
[LE:130145] [RE:131614] [DI: direct] >gp : [GI :e2 38702 :gl495277] [LN : BSYVEFGNS] 
[AC:Z71928] [PN: para-nitrobenzyl esterase] [GN : pnbA] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE :B. subtilis pnbA, sigL, yve [ J, K, L, M, N, O, P, Q, R, S , T] 

andyvf [A, B, C, D, E, F, G, H] genes.] [LE:132] [RE: 1601] [DI : complement] 

>gp: [GI :e313129 :gl945688] [LN : BSZ94043] [AC:Z94043] [PN : para-nitrobenzyl esterase] 
[GN:pnbA] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic DNA fragment 
(88 kb).] [LE:50057] [RE:51526] [DI : complement] 



NT AA 

ORF Name NT ID AAID — — , . score probability 
LengthLength c L 
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Description 

sp: [LN:HIS1_LACLA] [AC:Q02129] [GN:HISG] [OR : LACTOCOCCUS LACTIS] 
[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC : 2 . 4 . 2 . 17] [DE : ATP 

PHOSPHORIBOSYLTRANSFERASE, ] [SP:Q02129] [DB : swissprot] >pir : [LN : D45734] [AC:D45734 ] 
[PN :HisG] [OR:Lactococcus lactis subsp. lactis] [DB:pir2] >gp : [GI : g2565141] 
[LN:LLU92974] [AC :U92974 :M90760 :M90761] [PN:HisG] [GN: hisG] [OR : Lactococcus lactis] 
[DB : genpept-bctl] [DE : Lactococcus lactis unknown gene, partial cds, and HisC 
(hisC) , unknown, HisG (hisG) , unknown, HisB (hisB) , unknown, HisH (hish) ,HisA (hisA) , 
HisF (hisF) , HisIE (hisIE) , unknown, unknown, LeuA(leuA), LeuB (leuB) , LeuC (leuC) , 
LeuD (leuD) , unknown, IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , IlvA (ilvA) , AldB 
(aldB)and aldR (aldR) genes, complete cds.] [NT : phosphoribosyl -ATP synthetase] 
[LE:3125] [RE:3751] [DI:direct] 



NT AA 

ORF Name NT ID AAID — ^, T — , _ score probability 
LengthLength c 
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Description 

gp: [GI:g4433636] [LN: AF029224] [AC : AF029224 : AF029225] [PN:NirC] [GN:nirC] [FNrputative 
nitrite transporter] [OR : Staphylococcus carnosus] [DB :genpept-bct2] [DE : Staphylococcus 
carnosus nir and nar operons, complete sequences.] [LE:226] [RE: 1056] [DI:direct] 



250 



ORF Name 



NT AA 
NT ID AAID — , — scor e 
LengthLength 



probability 



IA1 7 503000985 1230437 cl 1528 



] |822 | [ 4594 || 408 ~||135 | p2~ || 2 . 9e-16 



Description 

sp: [LN:YWBH__BACSU] [AC : P3 9591] [GN : YWBH : IPA-23R] [OR : BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 14.3 KD PROTEIN IN EPR-GALK INTERGENIC REGION] [SP:P39591] 
[DBrswissprot] >pir : [LN: S39678] [AC : S39678 : F70051 ] [PN:ywbH protein: hypothetical 
protein ipa-23r] [GNrywbH ] [CL : conserved hypothetical protein HI1297] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : g413947] [LN: BSGENR] [AC:X73124] [GN:ipa-23r] [OR:Bacillus 
subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39591] 
[LE:23968] [RE:24354] [D I : complement] >gp: [GI :ell86331:g2 636367] [LN: BSUB0020] 
[AC:Z99123:AL009126] [GN : ywbH] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:alternate gene name: ipa-23r] [SP:P39591] [LE:133605] [RE:133991] [DI:direct] 



ORF Name 



IAI7503000985 12600305 tl 260 



Description 
NO-HIT 

ORF Name 



E 



17503000985 12590706 c3 2230 



Description 
NO-HIT 



NT ID AAID 



NT AA 
LengthLength 

| m 



score 



probability 



NT AA 
NT ID AAID — _ , — __ score 
LengthLength 



probability 



][ 



[4596 |P^~| 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



[AI7503000985 12697136 t2 862 



] P 5 | j 4557 |p3~|p90 ||1519 ||8.ie-156 



Description 

pir: [LN:S77609] [AC:S77609 ] [PN:probable intercellular adhesion protein B 

precursor :icaB protein] [GN: icaB ] [OR : Staphylococcus epidermidis] [DB:pir2] 

>gp: [GI:gll61381] [LN: SEU43366] [AC:U43366] [PN:IcaB] [GN : icaB] [OR : Staphylococcus 

epidermidis] [DB :genpept-bct2] [DE : Staphylococcus epidermidis operon mediating 

intercellular adhesion : IcaR, IcaA, IcaD, IcaB, and IcaC genes, complete cds . ] [LE:2265] 

[RE:3134] [DI :direct] 



ORF Name 



NT 



AA 



NTID AAID — — , , score pro bability 

LengthLength c L - 



IA17503000985 12698410 t3 1111 



Description 
NO-HIT 



ORF Name 



][ 



[4598 [|150 | 



NT AA 
NTID AAID score 
LengthLength 



probability 



A17b03000985_127002_tl_259 

Description 

NO-HIT 



j [827 | [4599 |p^~| |53 | 



251 



ORF Name 

1A17503000985 1291627 c3 2201 



NT ID 



NT 



AA 



AAID , L , „ 



Length Length 



score 



probability 
"] |828 | [4500 |pgT]|430 | pTTJT]| 4 . 7e-li2 



Description 

pir: [LN:B69876] [AC:B69876 ] [PN : acetylornithine deacetylase homolog ylmB] [GNrylmB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85127 : g2633908] [LN: BSUB0009] 
[AC : Z99112 : AL009126] [GNrylmB] [FN:unknown] [ORrBacillus subtilis] [DB : genpept -bet 1 ] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to acetylornithine deacetylase] [LE:8531] [RE:9811] [DIrdirect] 



ORF Name 



NTID 



NT AA 
AAID — _ — , , score 
LengthLength 



[AI7503000985 129578 ti 53 



] |825 | |460i |p^TF]|47i |p?y 



probability 
2 . 5e-129 



Description 

sp: [LN:YDGF_BACSU] [AC:P96704] [GNrYDGF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
TRANSPORT PROTEIN IN EXPZ-DINB INTERGENIC REGION] [SP:P96704] [DB : swissprot] 
>pir: [LN:H69782] [AC:H69782 ] [PN:amino acid ABC transporter (permease) homolog ydgF] 
[GN:ydgF ] [CL:arginine permease] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl020148:gl881368] [LN : AB001488] [AC : AB001488] [GN:ydgF] [OR:Bacillus subtilis] 
[SRrBacillus subtilis (strain:168) DNA] [DB : genpept -bet 1] [DE:Bacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT : PROBABLE AMINO 
ACID TRANSPORT PERMIASE . ] [LE: 139917] [RE: 141293] [D I : complement ] 

>gp: [GI:ell82528:g2632862] [LN : BSUB0003 ] [AC : Z99106 : AL009126] [GN:ydgF] [FNrunknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [NT:similar to amino acid ABC transporter (permease)] 
[SP:P96704] [LE:203663] [RE:205039] [DI : complement ] >gp : [GI :ell82 54 l:g2 632875] 
[LN:BSUB0004] [AC : Z99107 : AL009126] [GN:ydgF] [FN:unknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 4 of 21) : from 600701 
to813890.] [NT: similar to amino acid ABC transporter (permease)] [SP:P96704] [LE:5713] 
[RE : 7 0 8 9 ] [DI : complement ] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



|AI7503000985_1298202J:2_822 



[4602 |[T5351pi | [1779 ||2 . 3e-i83 



Description 

pir: [LN:G69848] [AC:G69848 ] [PN: fructose phosphotransferase system enzyme homolog 
yjdD] [GN:yjdD ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e!183221 : g2633555] 
[LN:BSUB0007] [AC : Z99110 : AL009126] [GN:yjdD] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept -bet 1] [DE:Bacillus subtilis complete genome (section 7 of 21) : from 
1194391to 1411140.] [NT: similar to fructose phosphotransferase system] [LE: 77806] 
[RE : 79575] [DI : direct] 



ORF Name 



NT AA 
NTID AAID — _^ — , , score 
LengthLength 



probability 



|A17B03000y9b 13089052 c3 2246 



] [831 | [4603 ||126 | [41 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



IA17503000985 1350051 c3 2086 



[4604 ||915 | [304 | pT5~ 



score probability 
I B. Ve-12 7 



Description 

gp: [GI:g4574118] [LN : AF009415] [AC : AF009415] [PN:choline transporter] [GN: cudT] 
[OR staphylococcus xylosus] [DB :genpept-bct2] [DE : Staphylococcus xylosus choline 
transporter (cudT) , putativeregulatory protein (cudC) , glycine betaine aldehyde 
dehydrogenase (cudA) , and choline dehydrogenase (cudB) genes, complete cds . ] [NT: CudT] 
[LE:811] [RE: 2433] [DIrdirect] 



252 



ORF Name 



NT 

NT ID AAID , 



AA 



LengthLength 



score 



IA175030009S5 1351687 ci 1515 



[ 833 | [4605 |p5~ | p01 |pr- 



probability 
|3.6e-73 



Description 

gp: [GI :gl644433] [LN: SAU31175] [AC:U31175] [PN: D-specif ic D-2 -hydroxyacid 
dehydrogenase] [GN:ddh] [OR : Staphylococcus aureus] [DB :genpept-bctl] [DE : Staphylococcus 
aureus D-specific D-2-hydroxyacid dehydrogenase (ddh) gene, complete cds . ] [NT:36.7 kDa 
protein; similar to NAD+-linked D-LDH, ] [LE:259] [RE: 1251] [DI: direct] 



ORF Name 



NT 

NT ID AAID _ 



AA 



LengthLength 



score 



IA*7503000$85 1359635 ±3 1248 



|4gog |pi~||i5g |prr 



probability 
2.5e-26 



Description 

pir: [LN:F69870] [AC:F69870 ] [PN:general stress protein homolog ykzA] [GNrykzA ] 
[CL: hypothetical protein yklA] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell81516:g2632036] [LN : BSAJ2571] [AC : AJ002571] [PN : YknA] [GN:yknA] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 168 56 kb DNA fragment 
between xlyA and ykoR.] [NT : homologous to OsmC from Escherichia coli] [LE: 34145] 
[RE:34555] [DI: direct] >gp : [GI:ell83336:g2 633670] [LN : BSUB0007] [AC : Z99110 : AL009126] 
[GN:ykzA] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [NT : alternate gene name: 
yzzE; similar to general] [LE: 187094] [RE: 187504] [DI:direct] 



ORF Name 



NT AA 
NT ID AAID — — _ score 
LengthLength 



probability 



IAI7503000985 1356012 tl 356 



J |835 | [4607 |pT3~||70 1|0.014 



Description 

sp : [LN : PF2R_HUMAN] [AC:P43088] [GN: PTGFR] [OR: HOMO SAPIENS] [SR:, HUMAN] [DE : RECEPTOR) 
(PGF2 ALPHA RECEPTOR) ] [SP:P43088] [DB : swissprot] >pir : [LN : A49973] [AC:A49973 ] 
[PN: prostanoid FP receptor] [GN: PTGFR :FP ] [CL prostaglandin E receptor EP1] [OR: Homo 
sapiens] [SR:, man] [DB:pir2] [MP : lp3 1 . 1 - lp3 1 . 1 ] 



ORF Name 



E 



17503000985 13678300 tl 40 



Description 
NO-HIT 



NT ID 



AAID 



NT AA 
LengthLength 
] |836 | [4608 ||165 | 



score 



probability 



ORF Name 



IAI7503000985 13711588 t2 551 



NT AA 
NT ID AAID — , — , score 
LengthLength 



probability 



] |837 | [4609 |p^~ ||54 [[ST [[0.0019 



Description 

pir : [LN:S53365] [AC:S53365 ] [PN:mucin SAC (clone CEL2 ) ] [GN:MUC5AC ] [OR:Homo sapiens] 
[SR:, man] [DB:pir2] [MP : llpl5 . 5-llpl5 . 5 ] 



ORF Name 



^17503000985 1376926 C3 2008 



NT ID 



838 



„ Tn NT AA 

AAID — — score 

LengthLength 



[4610 |p£5~j pHT" 



probability 
2.9e-4i 



Description 

gp: [GI:g3127079] [LN : AF061070] [AC : AF061070] [PN:PtxC] [GN:ptxC] [OR : Pseudomonas 
stutzeri] [DB : genpept -bct2 ] [DE : Pseudomonas stutzeri Orfll7 (orfll7) , Orf86 (orf86) 
genes, completecds; and ptxABCDE operon, partial sequence.] [NT: putative inner membrane 
component of] [LE:3217] [RE: 4044] [DI: direct] 



253 



ORF Name 



NT AA 
NT ID AAID — T — , , score 
Length Length 



probability 



E 



1 7 503000985 1379687b tii 710 



"j [839 | [4511 |pu~ j|59 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score 



E 



17503000985 13835462 c3 2043 



[4512 ||1449 | [482 ||13fl6 



probability 
II . Oe-141 



Description 

gp: [GI:el299584:g3687418] [LN : BLY17554] [AC:Y17554] [PN : permease] [GNrarcD] 
[OR: Bacillus lichenif ormis] [DB : genpept-bctl] [DE: Bacillus lichenif ormis arcA, arcB, 
arcC and arcD genes.] [LE:2579] [RE: 3985] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID . — . , . — 



LengthLength 



score 



IAI7503000985 13843910 ci 1637 



] [841 | [4513 [pig"" ]|71 |pW 



probability 
3.9e-22 



Description 

gp: [GI :el429613 :g4756156] [LN:A67169] [AC:A67169] [PN:NART GENE] [FN: NITRATE TRANSPORT] 
[OR -.Staphylococcus carnosus] [DB :genpept-pat] [DE: Sequence 9 from Patent EP0805205.] 
[LE:538] [RE:1704] [DI: direct] >gp : [GI :g2529402] [LN: SCU40014] [AC:U40014] [PN:nitrate 
transporter] [GN:narT] [OR : Staphylococcus carnosus] [SR : Staphylococcus carnosus 
strain=TM300] [DB : genpept - be t 2 ] [DE : Staphylococcus carnosus nitrate transporter (narT) 
gene, completecds . ] [NT :NarT] [LE:90] [RE:1256] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



|AI750300098b 13866433 t2 661 



■j |842 | [4514 ||138~||46 | 



Description 
NO-HIT 



ORF Name 



IAI7503000985 13869827 ti 1087 



NTID 



NT 



AA 



AAID , , 

LengthLength 



score 



probability 



j [4615 ||238F]|794 | [2176 ||i . 9e-225 



Description 

pir: [LN:E70041] [AC:E70041 ] [PN:probable copper- transporting ATPase, yvgX] [GN:yvgX ] 
[CL: Bacillus probable copper- transporting ATPase yvgX:ATPase nucleotide-binding domain 

homology :ATPase transduction domain homology : heavy-metal -associated homology] 
[OR:Bacillus subtilis] [EC:3.6.1.-] [DB:pirl] >gp : [GI : ell86038 :g2635863] [LN: BSUB0018] 
[AC:Z99121:AL009126] [GN:yvgX] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 18 of 21) : from 3399551to 3609060.] 
[NT: similar to heavy metal -transporting ATPase] [LE: 40633] [RE: 43044] [DI : complement] 



ORF Name 



U7503000985 13955288 ti 5 



NTID 



Description 
NO-HIT 

ORF Name 



NT AA 
AAID — « — , . score 
LengthLength 

|4616 | [54 | 



probability 



NTID 



NT AA 
AAID — « — , . score 
LengthLength 



probability 



|AI 7 503000985_14113806J:2_b59 

Description 

NO-HIT 



][ 



SIT 



[4517 ||T25 | |41 [ 



254 



ORF Name 



[Al7b0300098b 14222942 t2 747 



Description 
NO-HIT 



ORF Name 



IAI7503000985 1440890 il 362 



Description 
NO-HIT 



NT AA 
NTID AAID — «^ _ — , . score 
LengthLength 

|845 [ |4618 |pT^5 [ |45 | 



probability 



NTID 



NT AA 
AAID — « _ — , . score 
LengthLength 

|4619 ||339 [ |112 | 



probability 



ORF Name 



|AI750300098b 144S4393 c3 2148 



Description 
NO-HIT 



NTID 



^ TT> NT AA 

AAID — , _ — , . score 

LengthLength 



probability 



] |848 | [4620 |[T?T~]|47 | 



ORF Name 



NTID 



NT 



AA 



^ Le^thLe^th 



score 



IAI7503000985 14460882 cl 1400 



] |849 | [4521 |[TT7~||58 ||IgB" 



probability 
7.8e-15 



Description 

gp: [GI:gl022726] [LN : SHU3563 5] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE : Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT :0RF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



11750300098b 14460882 c3 1975 



NTID 



NT AA 
AAID — , — , , score 
LengthLength 



] |850 | [4622 |[i77~]p \^ZT 



probability 
1.7e-20 



Description 

gp: [GI:gl022726] [LN : SHU3563 5] [AC:U35635] [PNrunknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT :0RF1] 

[LE:1101] [RE: 1922] [DI : complement ] 



ORF Name 



NTID 



_ _ NT AA 

AAID — , T — , , score 

LengthLength 



probability 



[AX7503000985 14460882 c3 2098 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT : ORF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NTID 



NT AA 

AAID — ._ — _ score probability 
LengthLength ^ — - — 



&3C7503000585_i4460aa2_c3_21i2 | |852 | [ 4624 ||r7T~| | 58 | p5~|| 8 . 4e-19 



Description 

gp: [GI:gl022726] [LN : SHU3563 5] [AC:U35635] [PNrunknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT : ORF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



255 



ORF Name 



NT 



AA 



NTID AAID Le ^ t frLength score probability 



[AI7503000985 14460932 tl 34 



] |853 | [4525 ||177 1 158 | p3~ ||6 . 8e-17 



Description 

gp: [GI :gl022726] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE : Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID , ^ 

LengthLength 



score 



AI7503000985 14460932 £2 467 



] |854 | |4g2g |pT7^|p" 



probability 
1.3e-lS 



Description 

gp: [GI:gl022726] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept - be t 1 ] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT :ORFl] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NT 

NTID AAID _ — 



AA 



LengthLength 



score 



probability 



LA17503000985 14460932 t3 1045 



] |855 [ [4627 ||177~||Sfl ||218 ||S.9e-18 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept -bet 1] 

[DE staphylococcus haemolyticus IS12 72 ORF1 and 0RF2 genes, completecds.] [NT:ORFl] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



IA17503000985 14480312 cl 1349 



j |855 | [4528 ||1308^ |435 |p"ff 



score probability 
3.6e-57 



Description 

pir: [LN:F69581] [AC:F69581 ] [PN:acetoin dehydrogenase E2 component (dihydrolipoamide 
acetyltra) acoC] [GN:acoC ] [CL : dihydrolipoamide acetyltransf erase : 
lipoyl/biotin-binding homology] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell82798:g2633132] [LN : BSUB0005] [AC : Z99108 : AL009126] [PN:acetoin dehydrogenase 
E2 component] [GN:acoC] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21): from 802821 tol011250.] [NT : alternate gene name: 
yfjl] [LE:77735] [RE:78931] [DI: direct] >gp : [GI : dl025206 :g2780393] [LN:D78509] 
[AC:D78509] [PN:YfjI] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis YfjG-YfjR genes, complete cds.] [LE:7394] 
[RE : 8590] [DI : complement] 



ORF Name 


NTID 


_ _ NT AA 
AAID , — . , „ — , , score 
LengthLength 


probability 


AI7503000985J.4490756_ti_295 


857 


4629 rrr^ 


58 




Description 
NO-HIT 










ORF Name 


NTID 


M „ NT AA 
AAID — — score 
LengthLength 


probability 


|AI7503000985_14492125_t3_1310 




| |4 b ,U ||,4X | 


46 | 





Description 
NO-HIT 



256 



NT AA 

ORF Name NTID AAID Le ~ t h Le ^ t h sc ore probability 



AI750300098bJ.449b712_cl_lb93 | |859 | |463i ||29V — [ |99 | piJ^~j|5 . Oe-05 



Description 

gp: [GI:g4894301] [LN: AF065404] [AC : AF065404] [PN:pX01-85] [OR:Bacillus anthracis] 
[DB:genpept-bct2] [DE:Bacillus anthracis virulence plasmid PX01, complete sequence.] 
[LE:99636] [RE:100319] [DIrdirect] 

NT AA 

ORF Name NTID AAID — _„ — ^ score probability 
Length Length c *- 



|&:t7S030009fl5J.4b343a7J:3J.231 | [860 | [ 4632 |p3~| |74 |pi 1 |0. 00017 

Description 

gp: [GI:gl77875l] [LN : SLU73444] [AC:U73444] [PN:SLUSHA] [GN:slushA] [FN:mediates 
haemolytic activity] [OR : Staphylococcus lugdunensis] [DB : genpept-bctl] 
[DE : Staphylococcus lugdunensis SLUSH A (slushA) , SLUSH B (slushB) , andhemolysin 
(slushC) genes, complete cds . ] [LE:570] [RE: 701] [DI: direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

A17b0300098b_14S6221B_c3_2231 | |851 | [ 4633 || 192 ~| [ 53 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — _ score probability 
Length Length 



IAI7S0300098S 14562760 tl 3S9 



T5T" 



46 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



Length Length - 

AI75030009a5_14565637_t2_883 | [863 | | 463b || 132 ~| |43 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength c u 

AI7S030009Bb_14fc26432_c2jL687 | [864 | [ 4636 | |242q |806 | pFT|| 7 . 8e-142 



Description 

Sp : [LN : SECA_STAAU] [AC: 006446] [GNrSECA] [OR : STAPHYLOCOCCUS AUREUS] [DE : PREPROTEIN 
TRANSLOCASE SECA SUBUNIT] [SP:O06446] [DB : swissprot ] >gp : [GI :g2078390] [LN : SAU97062 ] 
[AC:U97062] [PN:SecA] [GN : secA] [FN : secretion] [OR : Staphylococcus aureus] 
[DB: genpept-bctl] [DE : Staphylococcus aureus NCTC 8325 SecA (secA) gene, complete cds.] 
[LE:440] [RE:2971] [DIrdirect] 



NT AA 

ORF Name NTID AAID — _ — , score probability 
LengthLength 



A^7503000985_14g44037_c3_2055 | |855 | [4637 |[I57S"| [524 | p!T^]|2 . 8e-40 



Description 

sp: [LN:YBJT_ECOLI] [AC:P75822] [GN : YBJT] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 53 . 7 KD 
PROTEIN IN ARTP-POXB INTERGENIC REGION] [SP:P75822] [DB : swissprot] 
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ORF Name 



NT AA 
NTID AAID — . T — score 
LengthLength ^ 



probability 



|AI750300098bJ.4(S)b0312_c3_204b | [ 865 | [ 4638 | |11 7 0 | [389 || 319 ||1.2e-28 

Description 

gp: [GI:el423961:g4584121] [LN : BCE7788] [AC : AJ007788] [GN: capA] [OR : Bacillus cereus] 
[DB:genpept-bctl] [DE:Bacillus cereus ilvD, ilvA, capA genes, orf4, orf5 and 
orf 6, partial.] [NT: related sequence M24150] [LE:2729] [RE: 3832] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — — _ score 
LerigthLength 



probability 



IAI7S03000985 14562577 c3 1984 



] F 7 1 |4g39 |p7— 1|178 | 



Description 
NO-HIT 



ORF Name 



NTID 



n7VTr% NT AA 
AAID — _ T — fcU score 
LengthLength - 



probability 



AI7B030009flB_i472037S_c3_2022 | [868 | [ 4540 |p^~| [ 241 || 292 | | S,Be-26 



Description 

pir: [LN:D70380] [AC:D70380 ] [PN : hypothetical protein aq_928] [GN:aq_928 ] [OR:Aquifex 
aeolicus] [DB:pir2] >gp: [GI :g2983456] [LN: AE000714] [AC : AE000714 : AE000657] [PN:putative 
protein] [GN:aq_928] [OR:Aquifex aeolicus] [DB : genpep t - bet 2 ] [DE:Aquifex aeolicus 
section 46 of 109 of the complete genome.] [LE:6398] [RE: 6988] [DI: direct] 



ORF Name 



NTID 



NT AA 

AAID — — . , score probabilit y 
LengthLength c 



IAI7503000985 14725887 t3 1312 



Description 
NO-HIT 

ORF Name 

|A17b0300098b 14729702 c2 1699 



|4641 |60 | 



NTID AAID 



NT 



AA 



— — score probability 
LengthLength c ^ 



J |870 | [4642 |prnT]|676 ||288 ||7.6e-22 



Description 

gp: [GI :g4322670] [LN: AF094508] [AC : AF094508] [PN:dentin phosphoryn] [OR:Homo sapiens] 
[SR: human] [DB : genpept -pri4 ] [DE : Homo sapiens dentin phosphoryn mRNA, complete cds . ] 
[LE:<1] [RE:2367] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — — . , score 
LengthLength 



probability 



AI7503000985 14742887 c2 1735 



Description 
NO-HIT 



j |87i | [4643 ||606 | |20i | 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength probability 



|&I7S03000386jL4742937_cijL445 | |872 | [4644 |[T?uT]|400 || 94 | |0.0033 [ 

Description 

gp: [GI:g2935567] [LN : AF049856] [AC : AF0498 56] [PN:M protein] [GN:emm] [OR : Streptococcus 
pyogenes] [DB :genpept-bct2] [DE : Streptococcus pyogenes strain SS1457 M protein (emm) 
gene, partialcds.] [LE:<1] [RE:>403] [DI:direct] 



ORF Name 



NT AA 

NTID AAID Le ~ thLe ~ t fr score probability 



A1750300098b_14843762_t3_117i 

Description 
NO-HIT 



] I 873 I | 4645 IF^~|| 4g I 
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NT AA 

ORF Name NT ID AAID Le ~ t fr Le ^ t fr sc ore probability 



Al7b0300098b_148blbbl_tl_41V | | 874 | [ 4546 |p3~ || 330 | |mT|| 5 . 6e-116 



Description 

gp: [GI :g2565150] [LN : LLU92 974] [AC : U92974 :M90760 :M90761] [PN:unknown] [OR : Lactococcus 
lactis] [DB:genpept-bctl] [DE : Lactococcus lactis unknown gene, partial cds, and HisC 

(hisC) , unknown, HisG (hisG) , unknown, HisB (hisB) , unknown, HisH (hish),HisA (hisA) , 
HisF (hisF) , HisIE (hisIE) , unknown, unknown, LeuA(leuA) , LeuB (leuB) , LeuC (leuC) , 
LeuD (leuD), unknown, IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , IlvA (ilvA) , AldB 

(aldB)and aldR (aldR) genes, complete cds.] [NT:0RF14] [LE:10850] [RE:11809] 

[DI : complement] 



NT AA 

ORF Name NT ID AAID — _„ — , _ score prob ability 



LengthLength - 

[AI7g03000985_i485i7i3_c2_ig50 | |875 | 14547 11565 I pi 1 1941 || i.4e-94 



Description 

pir: [LN:D69581] [AC:D69581 ] [PN:acetoin dehydrogenase El component (TPP- dependent 
alpha subuni) acoA] [GN: acoA ] [CL: pyruvate dehydrogenase (lipoamide) alpha chain: 
thiamine pyrophosphate -binding domain homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell82796 :g2633130] [LN : BSUB0005] [AC : Z99108 : AL009126] [PN:acetoin dehydrogenase 
El component] [GN : acoA] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 5 of 21) : from 802821 tol011250.] [NT : alternate gene name: 
yfjK] [LE:75688] [RE:76689] [DI:direct] >gp : [GI :dl025208 :g2780395] [LN:D78509] 
[AC:D78509] [PN: Yf j K] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] 
[DB: genpept-bctl] [DE:Bacillus subtilis YfjG-YfjR genes, complete cds.] [LE:9636] 
[RE:10637] [DI : complement] >gp: [GI :g2957146] [LN : AF006075] [AC : AF006075] 
[PN: TPP -dependent acetoin dehydrogenase, El] [GN : acoA] [OR: Bacillus subtilis] 
[DB:genpept-bct2] [DE: Bacillus subtilis acetoin dehydrogenase enzyme system gene 
cluster, ribosomal protein L6-like protein gene, partial cds, TPP-dependentacetoin 
dehydrogenase, El alpha-subunit (acoA) , TPP-dependentacetoin dehydrogenase, El 
beta-subunit (acoB) , dihydrolipoamideacetyl trans f erase (acoC) and dihydrolipoamide 
dehydrogenase (acoL) genes, complete cds, and regulatory protein (acoR) gene, 
partialcds.] [NT: alpha subunit of the El component of the acetoin] [LE:825] [RE: 1826] 
[DI: direct] >gp : [GI : g2957146] [LN: AF006075] [AC : AF006075] [PN : TPP- dependent acetoin 
dehydrogenase, El] [GN : acoA] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis 
acetoin dehydrogenase enzyme system gene cluster , ribosomal protein L6-like protein 
gene, partial cds, TPP-dependentacetoin dehydrogenase, El alpha-subunit (acoA) , 
TPP-dependentacetoin dehydrogenase, El beta-subunit (acoB) , 

dihydrolipoamideacetyl transferase (acoC) and dihydrolipoamide dehydrogenase 
(acoL)genes, complete cds, and regulatory protein (acoR) gene, partialcds.] [NT:alpha 
subunit of the El component of the acetoin] [LE:825] [RE: 1826] [DI: direct] 

NT AA 

ORF Name NTID AAID t — — , score probabil ity 
LengthLength L - 

|AI7b030009flSJL4fl79667_c2_17Si | | 876 | [4548 | 288 | [T?3~]| 9 . 7e-73 ~ | 

Description 

sp: [LN:PANC_BACSU] [AC:P52998] [GN : PANC] [OR:BACILLUS SUBTILIS] [EC:6.3.2.1] 
[DE: (PANTOATE ACTIVATING ENZYME) ] [SP:P52998] [DB : swissprot] >pir : [LN : H69671] 
[AC:H69671 ] [PN : pantothenate synthetase panC] [GN:panC ] [CL :pantoate- -beta-alanine 
ligase] [OR: Bacillus subtilis] [DB:pir2] >gp : [GI :gll46241] [LN : BACYPIA] [AC:L47709] 
[PN: pantothenate synthetase] [GN:panC] [FN pantothenic acid biosynthesis] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [EC:6.3.2.1] [DE: Bacillus subtilis (clone YAC15-6B) ypiABF 
genes, qcrABC genes , ypjABCDEFGHI genes, birA gene, panBCD genes, dinG gene, ypmB 
gene,aspB gene, asnS gene, dnaD gene, nth gene and ypoC gene, completecds 1 s . ] [NT: 4 0.8% 
of identity to the Escherichia coli] [LE: 14128] [RE: 14988] [DI: direct] 
>gp: [GI:ell83687:g2634660] [LN : BSUB0012] [AC : Z99115 : AL009126] [PN : pantothenate 
synthetase] [GN:panC] [FN : pantothenate biosynthesis] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [EC: 6. 3. 2.1] [DE: Bacillus subtilis complete genome (section 12 of 
21): from 2195541to 2409220.] [SP:P52998] [LE:156679] [RE:157539] [DI : complement] 
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NT AA 

ORF Name NT ID AAID Le ~ thIje ^ t h score probability 



A175u300098b_i48a00bl_ti_115 | |877 | |4649 | |TT7T~| | | 24b0 || 1 . 8e-254 



Description 

pir: [LN:C69621] [AC:C69621 ] [PN : f ructose-bisphosphatase , ] [GN:fbp:yydE ] [CL:Bacillus 
subtilis f ructose-bisphosphatase : phosphoesterase core homology] [OR: Bacillus subtilis] 
[EC:3.1.3.11] [DBrpirl] >gp: [GI : dl011939 :gl064791] [LN : BACGNTZA] [AC:D78193] [GN:yydE] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis 36kb sequence between gntZ and trnY genesencoding 34 ORFs . ] 
[LE:9575] [RE:11590] [DI:direct] >gp : [GI : ell84745 : g2636566] [LN : BSUB0021] 
[AC:Z99124 :AL009126] [PN: fructose- 1 , 6 -bisphosphatase] [GNifbp] [FN: gluconeogenesis] 
[OR: Bacillus subtilis] [DB : genpept -bet 1] [EC : 3 . 1 . 3 . 11] [DE: Bacillus subtilis complete 
genome (section 21 of 21): from 3999281to 4214814.] [NT : alternate gene name: yydE] 
[LE:127957] [RE:129972] [DI:direct] 



NT AA 

ORF Name NT ID AAID — — score probability 



LengthLength - 

A17503000385_1488i250J:3_i040 | |878 | [ 4650 |332 | [ 1407 | | 5 . 9e-i44 



Description 

gp: [GI :gl644433] [LN : SAU31175] [AC:U31175] [PN : D-specif ic D-2-hydroxyacid 
dehydrogenase] [GN:ddh] [OR: Staphylococcus aureus] [DB :genpept-bctl] [DE : Staphylococcus 
aureus D-specif ic D-2 -hydroxy acid dehydrogenase (ddh) gene, complete cds.] [NT:36.7 kDa 
protein; similar to NAD+-linked D-LDH, ] [LE:259] [RE: 1251] [DI:direct] 

NT AA 

ORF Name NT ID AAID — , — , score probability 
LengthLength L - 

|AI7503000985_i4881308_cl_1494 | |879 | [4551 ||ZT1~| |7 0 | |7 5 | p~ P 42 | 

Description 

gp: [GI:g4406247] [LN : AF105113] [AC : AF105113] [PN:putative oligosaccharide repeat unit] 
[GN:cpsl9AJ] [OR: Streptococcus pneumoniae] [DB : genpept -bct2 ] [DE : Streptococcus 
pneumoniae type 19A putative oligosaccharide repeatunit transporter (cpsl9AJ) gene, 
partial cds; UDP-N-acetylglucosamine-2 -epimerase (cpsl9AK) , glucose- 1 -phosphate 
thymidylyl trans f erase (cpsl9AL) , dTDP-4-keto-6-deoxyglucose-3 , 5-epimerase (cpsl9AM) , 
dTDP-glucose-4, 6-dehydratase (cpsl9AN) , anddTDP-L-rhamnose synthase (cpsl9AO) genes, 
complete cds; and AliA(aliA) gene, partial cds.] [NT:Cpsl9AJ] [LE:<1] [RE: 818] 
[DI :direct] 



NT AA 

ORF Name NT ID AAID — _ — _ score probability 
Length Length c *- 



|mB0300098Sjl450i67fi_c3_223S | |880 | [4^52 | |888 ~|| 295 | p8 | | 3.2e-58 | 

Description — 
pir: [LN:B69772] [AC:B69772 ] [PN : conserved hypothetical protein ydbO] [GN:ydbO ] 

[CL:conserved hypothetical protein MJ0449] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl020044:gl881264] [LN : AB001488] [AC : AB001488] [GN:ydbO] [OR:Bacillus subtilis] 

[SR:Bacillus subtilis (strain:168) DNA] [DB : genpept -bet 1] [DE:Bacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT : FUNCTION UNKNOWN, 
SIMILAR PRODUCT IN B. SUBTILIS] [LE:39929] [RE:40801] [DI: direct] 

>gp: [GI:ell82420:g2632754] [LN: BSUB0003] [AC : Z99106 : AL009126 ] [GN:ydbO] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [NT:similar to hypothetical proteins] [LE:103673] 
[RE : 104545] [DI : direct] 
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NT AA 

ORF Name NT ID AAID Le ^ t h Length score probability 



AI76030009BbJL49Jlb01_c2JL790 | pi | [ 4553 | |1062 | |353 | |802 || 7.7e-80 



Description 

pir: [LN:E71373] [AC:E71373 ] [PN:probable regulatory protein (pf oS/R) ] [GN:TP0038 ] 
[OR: Treponema pallidum subsp. pallidum] [SR:, syphilis spirochete] [DB:pir2] 

>gp: [GI:g3322295] [LN : AE001189] [AC : AE001189 : AE00052 0] [PN : regulatory protein (pfoS/R) ] 
[GN:TP0038] [OR : Treponema pallidum] [DB : genpept - be t 2 ] [DE : Treponema pallidum section 5 

of 87 of the complete genome.] [NT: similar to GP: 1354775 percent identity: 100.00;] 
[LE : 1177] [RE : 2229] [DI : complement] 



NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



A:t7503000985_i49797i3_ciJ.590 | |882 | [ 4S54 || jgQ | p^T~|| l . 9e- "2T 



Description 

sp: [LN:YORZ_LISMO] [AC:P33385] [OR: LISTERIA MONOCYTOGENES] [DE : REGION (ORFZ) ] 
[SP:P33385] [DB : swissprot] >pir : [LN : F43868] [AC:F43868 ] [PN:ORFZ] [OR:Listeria 
monocytogenes] [DB:pir2] >gp : [GI : gl49647] [LN: LISACTLDH] [AC : M82881] [OR:Listeria 
monocytogenes] [SR:Listeria monocytogenes (strain L028) DNA] [DB :genpept-bctl] 
[DE: Listeria monocytogenes lecithinase, lactate dehydrogenase (actA)gene complete cds, 
(plcB) gene complete cds, (ldh) gene completecds . ] [NT: ORFZ] [LE:3582] [RE: 4043] 
[DI : direct] 



ORF Name 



IAI7503000985 15052187 £3 1188 



Description 
NO-HIT 



ORF Name 



LAI7503000985 15052188 £2 597 



Description 
NO-HIT 



ORF Name 



117503000985 15126592 £3 1294 



NTID AAID — — , . score 
LengthLength 



probability 



] |883 | |4655 |[X25 | 



NTID 



NT AA 
AAID — — , , score 
Length Length 



probability 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



][ 



[4557 |pTOT]pgi ||183i ||7.0e-189 



Description 

pir: [LN:S77610] [AC:S77610 ] [PN: probable intercellular adhesion protein C:icaC 
protein] [GN:icaC ] [OR : Staphylococcus epidermidis] [DB:pir2] >gp : [GI :gll61382] 
[LN:SEU43366] [AC:U43366] [PN:IcaC] [GN:icaC] [OR : Staphylococcus epidermidis] 
[DB : genpept -bct2] [DE : Staphylococcus epidermidis operon mediating intercellular 
adhesion: IcaR, IcaA, IcaD, IcaB, and IcaC genes, complete cds.] [LE:3121] [RE:4188] 
[DI : direct] 
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ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



117503000985 1537 ci 1639 



j [886 | [4658 |pT~||50 |p [[0.00073 



] 



Description 

sp: [LN:YDGI_BACSU] [AC:P96707] [GNrYDGI] [OR:BACILLUS SUBTILIS] [EC : 1 . -.-.-] 
[DE : PUTATIVE NAD ( P ) H NITROREDUCTASE YDGI , ] [SP:P96707] [DB : swissprot] >pir : [LN : C69783 ] 
[AC:C69783 ] [ PN : NADH dehydrogenase homolog ydgl] [GNiydgl ] [CL : nitroreductase] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl020152 : gl881372] [LN: AB001488] [AC : AB001488] 
[GNrydgl] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis genome sequence, 148 kb sequence of the 
regionbetween 35 and 47 degree.] [NT: SIMILAR TO NITROREDUCTASE.] [LE: 145410] 
[RE:146039] [DI : complement ] >gp : [GI : el 18 2 54 5 :g2 632879] [LN : BSUB0004] 

[AC:Z99107 :AL009126] [GN:ydgI] [FN:unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] 

[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 

[NT: similar to NADH dehydrogenase] [SP:P96707] [LE: 11206] [RE: 11835] [DI : complement] 



ORF Name 



NTID 



IAI7503000985 155387 ci 1567 



■SWT 



NT AA 

AAID — 

Length Length 

[4559 |[rT8~| [45 | 



score 



probability 



Description 
NO-HIT 



ORF Name 



LAI7503000985 15671925 c3 2051 



Description 
NO-HIT 



ORF Name 



IAI7503000985 157093 c2 1947 



Description 
NO-HIT 



ORF Name 



[AI7503000985 15711457 13 919 



Description 
NO-HIT 



ORF Name 



VI7503000985 157513 c3 2158 



Description 
NO-HIT 



ORF Name 



IAI7503000985 15752187 12 564 



Description 
NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



578 1 14660 11138 I [T5" 



NT AA 
NTID AAID „ — , — , score 
Length Length 



probability 



J |889 | [4661 |prr-||48 | 



NT AA 

NTID AAID — _ — , , score 

Length Length 



probability 



|890 | [4652 IpOI | [56 | 



NT AA 

NTID AAID — . — , , score 

LengthLength 



probability 



] I 891 I | 4gg3 IP^H | 51 | 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



[892 | [4664 ||i62"| [5T 
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ORF Name 



NTID 



NT AA 
AAID — , — , , score 
LengthLength 



probability 



1750300098b lbVBOV el 1513 



] |893 [ [4665 | p5¥]|527 | pgTT| |l . 8e-183 



Description 

gp: [GI:e316580:g2791907] [LN : SSK11MECA] [AC:Y13094] [GN : CTORF585] [OR : Staphylococcus 
sciuri] [DB:genpept-bctl] [DE:S.sciuri mecA gene, strain Kll (792).] [LE:<1] [RE:1757] 
[DI : complement] 



ORF Name 



NT AA 

NTID AAID — — _ score pro bability 

LengthLength • L - 



|A:£7503000985 15814001 ti 269 



Description 
NO-HIT 

ORF Name 



|894 | [4555 |pT~j|50 | 



NTID 



„ TO NT AA 
AAID — — , score 
LengthLength 



IAI7503000985 158411 c3 2104 



probability 
] I 895 | [4557 ||gg5~||330 ||870 ||4.8e-87 



Description 

sp: [LN:LDH_BACSU] [AC:P13714] [GN : LDH : LCTE] [OR : BACILLUS SUBTILIS] [EC : 1 . 1 . 1 . 2 7] 
[DE : L- LACTATE DEHYDROGENASE,] [SP:P13714] [DB : swissprot] 



ORF Name 



NT AA 
NTID AAID — _ — . , score 
LengthLength 



probability 



IA17503000985 15894527 13 1275 



Description 
NO-HIT 



J |896 | |4668 ||ZF7 [ |88 | 



ORF Name 



NTID 



_ _ _ NT AA 

AAID — , » _ — , . score 

LengthLength 



probability 



^17503000985 15914762 tl 251 



j |897 | [4669 |[57g— 1|192 ||654 ||3.7e-64 



Description 

gp: [GI:g4574119] [LN : AF009415] [AC : AF009415] [PN:putative regulatory protein] [GN:cudC] 
[OR: Staphylococcus xylosus] [DB : genpept -bct2 ] [DE : Staphylococcus xylosus choline 
transporter (cudT) , putativeregulatory protein (cudC) , glycine betaine aldehyde 
dehydrogenase (cudA) , and choline dehydrogenase (cudB) genes, complete cds . ] [NT: CudC] 
[LE : 2604] [RE : 3164] [DI : complement] 



ORF Name 



NTID 



„ T „ NT AA 

AAID — « — . . score 

LengthLength 



probability 



IAI7503000985 16048828 c3 2224 



Description 
NO-HIT 



2 |898 | [4670 ||261 | |B6 | 



ORF Name 



NTID 



^ ^ NT AA 

AAID — — . score 

LengthLength 



probability 



^17503000985 164715 c3 2251 



] |899 | [4671 [pT-[|316 | pfg~||3 . 8e-78" 



Description 

gp: [GI:g4433641] [LN : AF029224] [AC : AF029224 : AF029225] [PN:SirB] [GNrsirB] 
[OR staphylococcus carnosus] [DB : genpept -bet 2] [DE : Staphylococcus carnosus nir and nar 

operons, complete sequences.] [NT: similar to C-terminus of Esherichia coli CysG.] 
[LE:5079] [RE:6032] [DI: direct] 



ORF Name 



AI7503 000985_16600062_tl_117 

Description 

NO-HIT 



NT AA 
NTID AAID — _ T — , x score 
Length Length 



probability 



|900 | [4672 ||132 | 
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ORF Name 



NT AA 
NT ID AAID — _ — . . score 
LengthLength 



probability 



F 



17503000985 16699067 el 1358 



] [901 | [4673 [py— 1|198 |pu~||i.2e-44 



Description 

sp: [LN:HIS7_LACLA] [AC:Q02134] [GN : HISB] [OR : LACTOCOCCUS LACTIS] 

[SR : , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC : 4 . 2 . 1 . 19] [DE : IMIDAZOLEGLYCEROL- PHOSPHATE 
DEHYDRATASE, (IGPD)] [SP:Q02134] [DB : swissprot] >pir: [LN:G45734] [AC : G45734 : C368 90 ] 
[PN : HisB] [CL : imidazoleglycerol -phosphate dehydratase : imidazoleglycerol -phosphate 
dehydratase homology] [OR: Lactococcus lactis subsp. lactis] [DB:pir2] >gp : [GI : g2565143] 
[LN:LLU92974] [AC :U92974 :M90760 :M90761] [PN:HisB] [GN : hisB] [OR : Lactococcus lactis] 
[DB : genpept-bctl] [DE : Lactococcus lactis unknown gene, partial cds, and HisC 
( hi sC ), unknown, HisG (hisG) , unknown, HisB (hisB) , unknown, HisH (hish) ,HisA (hisA) , 
HisF (hisF) , HisIE (hisIE) , unknown, unknown, LeuA(leuA), LeuB (leuB) , LeuC (leuC) , 
LeuD (leuD) , unknown, IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , IlvA (ilvA) , AldB 
(aldB)and aldR (aldR) genes, complete cds . ] [NT : dehydratase] [LE:5869] [RE:6471] 
[DI : direct] 



ORF Name 



NT 



AA 



NT ID AAID — _ „ — , . score probabili ty 
LengthLength * c J - 



IAI7503000985 15798125 c2 1852 



[902 | [4574 [pi"~ ]|145 ||154 |[2.5e-10 



Description 

sp: [LN:GUDT_BACSU] [AC:P42237] [GN : YCBE] [OR: BACILLUS SUBTILIS] [DE : PROBABLE GLUCARATE 
TRANSPORTER] [SP:P42237] [DB : swissprot] >pir : [LN: H69752] [AC:H69752 ] [PN : probalble 
glucarate transporter] [GN : ycbE ] [CL : hexuronate transporter] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl007040 : g709999] [ LN : BACYCB2 0 ] [AC:D30808] [PN : glucarate 
dehydratase] [GN:ycbE] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : 16 8TrpC2 ) 
DNA] [DB: genpept-bctl] [DE:Bacillus subtilis DNA around 20 degrees region of 
chromosomecontaining yckA-T genes.] [LE:3924] [RE: 5291] [DI: direct] 

>gp: [GI:ell82200:g2632534] [LN : BSUB0002 ] [AC : Z99105 : AL009126] [GN:ycbE] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [NT:similar to glucarate transporter] [SP:P42237] 
[LE:75738] [RE:77105] [DI:direct] 



ORF Name 



LAI7503000985 15802312 c3 2228 



Description 
NO-HIT 



NT ID 



][ 



„ TT% NT AA 

AAID — , — , score 

LengthLength 



probability 



[4575 ||138 | p"5" 



ORF Name 



IAI7503000985 15834377 c2 1851 



Description 
NO-HIT 



NT ID 



][ 



AAID . — . 



NT AA 
LengthLength 
] |4676 ||TT7 [ [4 8 | 



score probability 
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NT AA 

QRF Name EEIR MID Le ^thLe^th SCQre P^ability 



AI7B030009BbJ.^JB207J:l_122 | [905 | [4677 ||288 | [95 || 175 | p.le-13 



Description 

sp: [LN:YJDJ_ECOLI] [AC:P39274] [GN : YJDJ] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 10.5 KD 
PROTEIN IN DCUB-LYSU INTERGENIC REGION (O90A) ] [SP:P39274] [DB : swissprot] 
>pir : [LN:S56356] [AC : S56356 : F65222 ] [PN : hypothetical 10 . 5K protein (dcub-lysu 
intergenic region) : hypothetical protein o90a] [GN:yjdJ ] [OR : Escherichia coli] 
[DB:pir2] >gp: [GI:g536972] [LN:ECOUW93] [AC:U14003] [OR : Escherichia coli] 
[DB:genpept-bctl] [DE : Escherichia coli K-12 chromosomal region from 92.8 to 00.1 
minutes.] [NT : ORF_o90a] [LE:42913] [RE:43185] [DI:direct] >gp : [GI : gl790569] 
[LN:AE000485] [AC : AE000485 :U00096 ] [PN:orf, hypothetical protein] [GNryjdJ] [FN:orf; 
Unknown] [OR : Escherichia coli] [DB :genpept-bct2] [DE : Escherichia coli K-12 MG1655 
section 375 of 400 of the completegenome . ] [NT:o90a; 100 pet identical amino acid 
sequence and] [LE:6496] [RE: 6768] [DI:direct] 



NT AA 

QRF Name NT ID AAID — _ — _ score probability 
LengthLength -L 

p7503000985_i70i0952_c3_2i47 | [90S | [ 4578 ||gg3~| |267 |p9 \ \lAe-62 "| 

Description 

pir: [LN:G70080] [AC:G70080 ] [PN : conserved hypothetical protein yxkD] [GN:yxkD ] 
[CL: conserved hypothetical protein yitT] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell86383:g2636419] [LN : BSUB0020] [AC : Z99123 : AL009126] [GNryxkD] [FN:unknown] 
[OR: Bacillus subtilis] [DB : g enpep t -bet 1 ] [DE: Bacillus subtilis complete genome (section 
20 of 21): from 3798401to 4010550.] [NTrsimilar to hypothetical proteins] [LE:188319] 
[RE:189155] [DI : complement] >gp : [GI :dl012387 :gl783243] [LN:D83026] [AC : D83026 :D45911] 
[GN : yxkD] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis genome sequence covering lic-cel region.] 
[NT homologous to jojC gene product (B. subtilis;] [LE:35310] [RE:36146] [DIrdirect] 



NT AA 

QRF Name NTID AAID v — . — , score probability 
Length Length *~ 



AI750300098SJ.79010_c3_:*136 | |907 | [4679 |p^~ 1 1 181 | p3~ || 2 . 6e-31 " 



Description 

Sp : [LN: 0GT_HAEIN] [AC:P44687] [GN: OGT : DAT1 : HI 04 02] [OR : HAEMOPHILUS INFLUENZAE] 
[EC:2. 1.1.63] [DE : ALKYLTRANSFERASE) ] [SP:P44687] [DB : swissprot] >pir: [LN:G64065] 
[AC:G64065 ] [PN : methylated- DNA- -protein-cysteine S-methyltransf erase homolog ] 
[CL : methylated-DNA- -protein- cysteine 

S-methyltransf erase :methylated-DNA- -protein-cysteine S-methyltransf erase homology] 
[OR Haemophilus influenzae] [DB:pir2] >gp : [GI :gl573373] [LN:U32723] [AC :U3 2723 :L4 2023] 
[PN: methylated-DNA- -protein- cysteine] [GN:HI0402] [OR : Haemophilus influenzae Rd] 
[DB:genpept-bct2] [DE : Haemophilus influenzae Rd section 38 of 163 of the complete 

genome.] [NT:similar to SP:P11742 GB:X15659 PID:39876] [LE:9611] [RE:10183] [DI:direct] 

NT AA 

QRF Name NTID AAID — , — , score probability 
Length Length c 



&:HB0300098S_i8ifl8S_c3_2i6i | |908 | |4£80 ||T?T~[ p - 



Description 
NO-HIT 



265 



NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 
|A17503000985_19400i_rl_134 | | 909 | [ 4681 || 165 | [52 ||5 S | |0 , 025 "] 

Description 

sp : [LN : RK19_GUITH] [AC:O78409] [GN:RPL19] [OR : GUILLARDIA THETA] [SR : , CRYPTOMONAS PHI] 
[DE : CHLOROPLAST SOS RIBOSOMAL PROTEIN L19] [SP:O78409] [DB : swissprot ] >gp : [GI : g36 02933 ] 
[LN:AF041468] [AC:AF041468 : X14171 : X62349 : X51511 : X14504 : X52158 : X52 912 : X56806 : M7654 7] 
[PN:ribosomal protein L19] [GN:rpll9] [OR : Chloroplast Guillardia theta] [SR : Guillardia 
theta] [DB: genpept -pln2] [DE : Guillardia theta complete plastid genome J [LE:181] 
[RE: 573] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ „ — , , score proba bility 
Length Length J - 

A17503000985_i953143g_c3_20g9 | |910 | [ 4682 |p^5~|| 454 | pu~|| 4 . 7e-05 



Description 

gp: [GI:gl813493] [LN : BFU643 14] [AC:U64314] [PN : hydrophobic protein] [ORrBacillus 
firmus] [DB : genpept -bet 1 ] [DE:Bacillus firmus putative hydrophobic protein gene, 
partial cds . ] [NT: similar to Bacillus subtilis putative protein] [LE:193] [RE:>795] 
[DI : direct] 

NT AA 

ORF Name NTID AAID — _ T — , , score probability 
LengthLength c 

AI7S03000985_195g2805_c3_213b | [911 | [4583 [ 70 |p || 0.036 



Description 

sp: [LN:VNB_INBMF] [AC:P16200] [GN:NB] [OR : INFLUENZA B VIRUS] [SR : B/MEMPHIS/3/89 , ] 
[DE:NB GLYCOPROTEIN] [SP:P16200] [DB : swissprot] >pir : [LN : A36825] [AC:A36825 ] [PN:NB 
glycoprotein] [CL : influenza B virus NB glycoprotein] [OR : influenza B virus] [DB:pirl] 
[MP:segment 6 ] >gp : [GI :g325221] [LN: FLBNAE] [AC:M30635] [OR : Influenza B virus] 
[SR: Influenza B/Memphis/3/89 , cDNA to viral RNA] [DB : genpept -vrl] [DE : Influenza 
B/Memphis/3/89, neuraminidase and NB (seg 6) RNA, complete cds.] [NT :NB protein] [LE:16] 
[RE: 315] [DI: direct] 



NT AA 

ORF Name NTID AAID — _ — ^ score probability 
LengthLength *- 



IAI7503000985 19564702 c3 2073 1 \5T2 



^ rj 



TFS 11397 ||6.3e-37 



Description 

sp: [LN:GSHZ_NICSY] [AC:P30708] [OR : NI COT I ANA SYLVESTRIS] [SR:,WOOD TOBACCO] 
[EC:1.11.1.9] [DE: GLUTATHIONE PEROXIDASE HOMOLOG 6P229,] [SP:P30708] [DB : swissprot] 
>pir: [LN:S20501] [AC:S20501 ] [PN : glutathione peroxidase homolog] [CL : glutathione 
peroxidase] [OR : Nicotiana sylvestris] [SR:, wood tobacco] [DB:pir2] >gp : [GI :gl9739] 
[LN:NS6P229] [AC:X60219] [OR : Nicotiana sylvestris] [SR:wood tobacco] [DB : genpept -plnl] 
[DE:N. sylvestris mRNA for 6P229 polypeptide homologous to animalglutathione 
peroxidases.] [NT homologous to animal glutathione peroxidases] [SP:P30708] [LE:154] 
[RE: 663] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength u 



IAI7503000985 19585877 ci 1478 



1 14685 111212 I [TOT 



3.1e-83 



Description 

pir: [LN:H69817] [AC:H69817 ] [PN : aminoacylase homolog yhaA] [GN:yhaA ] [CL:hippurate 
hydrolase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83009 :g2633343] [LN: BSUB0006] 
[AC:Z99109:AL009126] [GN:yhaA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to aminoacylase] [LE: 80123] [RE: 81313] [DI : complement] 
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ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



11750300098b 196940 50 t3 1299 



"J [914 | [4585 ||147 | [48 | 



Description 
NO-HIT 



ORF Name 



NTID 



„ Tn NT AA 

AAID — « — . . score 

LengthLength 



probability 



7503000985 197090 c2 177S 



5T5" 



Description 
NO-HIT 



ORF Name 



NTID 



AI750300098b 19709637 Cl 1576 



9T5~ 



MT „ NT AA 

AAID — , — , . score 

LengthLength 

[4588 ||TT7 [ |48 | 



probability 



Description 
NO-HIT 

ORF Name 



IAI7503000985 19720542 c2 1710 



NTID 



NT 

AAID . 



AA 



LengthLength 



[4589 |[TTg%-| pT 



score probability 
| pr7e-50 



Description 

gp: [GI :el3 58508 :g3 980137] [LN: LM034616] [AC : AJ007319] [PN : succinyl -diaminopimelate 
desuccinylase] [GNrdapE] [OR: Listeria monocytogenes] [DB :genpept-bctl] [DE: Listeria 
monocytogenes ascB, inlG, inlH, inlE, dapE genes.] [LE:5533] [RE: 6672] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



^17503000985 197312 c2 1804 



][ 



[4590 |pT3~[|290 |[5TT 



score probability 
5.2e-81 



Description 

sp: [LN:YHDF_BACSU] [AC:O07575] [GN: YHDF] [OR:BACILLUS SUBTILIS] [EC:1. -.-.-] [DE:(EC 
1. -.---)] [SP:O07575] [DB : swissprot ] >pir : [LN : D69825] [AC:D69825 ] [PN:glucose 
1-dehydrogenase homolog yhdF] [GN:yhdF ] [CL : short -chain alcohol dehydrogenase 
homology] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell82946 : g2633280] [LN : BSUB0006] 
[AC:Z99109:AL009126] [GN:yhdF] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT:similar to glucose 1-dehydrogenase] [SP:O07575] [LE:22211] [RE:23080] [DI:direct] 
>gp: [GI:ell91878:g2226201] [LN :BSY14082] [AC:Y14082] [PN : hypothetical protein] 
[GN:yhdF] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis chromosomal 
DNA, region 72 to 75 degrees: spoVRto sspB.] [NT : Similarity to glucose and ribitol 
dehydrogenase] [SP: 007575] [LE:8859] [RE: 9728] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



AI7503000985 19781305 tl 20 



Description 
NO-HIT 



[4691 ||T21 | pT 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



E 



17503000983 19787788 t3 1311 



] |920 | 



4592 | |155 | [51 | 



Description 
NO-HIT 
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ORF Name 



1AI7503000985 19929586 c3 2180 



Description 
NO-HIT 



NT AA 
NTID AAID T - , T ~ , score 
Length-Length 



probability 



] P 21 1 |4693 ||198 |p" 



ORF Name 



IAI7503000985 20081538 £1 158 



Description 
NO-HIT 



NTID 



WIT 



NT AA 
AAID — , » _ — . score 
Length Length 

[4554 |p^~] |40 | 



probability 



ORF Name 



NTID 



7V7vtt ^ NT AA 

AAID — « — 1-1 score 

LengthLength 



1A17503000985 20167186 c2 1958 



[923 | [4695 ||T?g-~| [51 [p^T 



probability 
|4.0e-19 



Description 

gp: [GI:g4096799] [LN: SCU40158] [AC:U40158] [OR : Staphylococcus carnosus] 
[DB :genpept-bct2] [DE : Staphylococcus carnosus response regulator- like protein 
(orfx)gene, partial cds . ] [NT:orfx; function unknown; similar to response] [LE:<1] 
[RE: 560] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ , — _ score 
LengthLength 



IA17503000985 20317 c2 1727 



][ 



[4696 |[2088 



probability 
8.4e-38 



Description 

sp: [LN:TAGF_BACSU] [AC:P13485] [GN : TAGF : RODC : TAG 3 ] [OR : BACILLUS SUBTILIS] [DE:TEICHOIC 
ACID BIOSYNTHESIS PROTEIN F] [SP:P13485] [DB : swissprot] >pir : [LN : S06049] 
[AC:S06049:G69720 ] [PNrprobable CDPglycerol glycerophosphotransf erase, 
:CDP-glycerol : polyglycerol phosphate glycero-phosphotransf erase tagF:rodC 
protein: teichoic-acid synthase] [GN : rodC : tag- 3 : tagF ] [OR: Bacillus subtilis] 
[EC:2.7.8.12] [DB:pir2] [MP:310 degrees ] >gp : [GI : g40100] [LN:BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NT:rodC 
(tag3) polypeptide (AA 1-746)] [SP:P13485] [LE:2178] [RE:4418] [DIrdirect] 
>gp: [GI :ell84478:g2 636098] [LN : BSUB0019] [AC : Z99122 : AL009126] 

[PN:CDP-glycerol .-polyglycerol phosphate] [GN:tagF] [FN:teichoic acid biosynthesis] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] [SP:P13485] 
[LE:78129] [RE:80369] [DI : complement] >gp: [GI : ell 844 78 :g2636098] [LN:BSUB0019] 
[AC:Z99122 :AL009126] [PN : CDP-glycerol : polyglycerol phosphate] [GN: tagF] [FN: teichoic 
acid biosynthesis] [OR: Bacillus subtilis] [DB:genpept] [DE: Bacillus subtilis complete 
genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] 
[SP:P13485] [LE:78129] [RE:80369] [D I : complement] 



ORF Name 



lA!t7503000985 20323403 r3 1320 



Description 
NO-HIT 



NT 



AA 



NTID AAID — — , , score probability 
- — ~~ LengthLength c *- 



1 P 25 1 | 4697 H 141 IP 5 " 



ORF Name 



NTID 



„ T „ NT AA 

AAID — — , , score 

LengthLength 



IAI7503000985 20335260 c3 2081 



] P 26 | [4698 ||188r||626 [p3T 



probability 
8.Se-223 



Description 

gp: [GI:g4098081] [LN: LLU73336] [AC:U73336] [PN : anaerobic ribonucleotide reductase] 
[GN:nrdD] [OR : Lactococcus lactis] [DB :genpept-bct2] [DE : Lactococcus lactis anaerobic 

ribonucleotide reductase (nrdD) andanaerobic ribonucleotide reductase activator protein 
(nrdG) genes, complete cds.] [NT : NrdD] [LE:167] [RE:2410] [DI:direct] 



268 



ORF Name 



NT ID 



AAID 



NT 



AA 



Length Length 



score probability 



17b0300098b 2034>OUBb tl 1003 



j |927 | [4599 |pgT]|560 | pug— ||7 . 3e-91 



Description 

sp: [LN:YHXB_BACSU] [AC:P18159] [GN : YHXB] [OR : BACILLUS SUBTILIS] [EC:5.4.2.8] 
[DE : PROBABLE PHOSPHOMANNOMUTASE, (PMM) ] [SP:P18159] [DB : swissprot] >pir : [LN: C69835] 
[AC:C69835:D45868 :S18566 ] [PN: phosphomannomutase homolog yhxB : hypothetical protein 
(glpD 3' region)] [GN : yhxB ] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell82920:g2633254] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN : yhxB] [FNiunknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NTrsimilar to phosphomannomutase] [SP:P18159] 
[LE:203459] [RE:205156] [DI: direct] >gp : [GI : ell82932 : g2633266] [LN : BSUB0006] 
[AC:Z99109:AL009126] [GN : yhxB] [FN:unknown] [ORiBacillus subtilis] [DB : genpept - bet 1 ] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to phosphomannomutase] [SP:P18159] [LE:6779] [RE: 8476] [DI: direct] 
>gp: [GI:e324943:g2226139] [LN : BSY14079] [AC:Y14079] [PN : hypothetical protein] [GN : yhxB] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis chromosomal DNA, region 
75 degrees: glpPFKDoperon and downstream.] [NT:see EMBL M34393 and Swiss Prot P18159.; 
This could] [SP:P18159] [LE:6427] [RE: 8124] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



probability 



IAI7503000985 20410307 £2 584 



J |928 | [4700 |pr?g— ||4i | 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — _ — score 
Length Length 



probability 



IAI7503000983 20413202 fl 47 



] |929 | [4701 ||TT52-||383 | p^~]|l . le-96 



Description 

Sp: [LN: YHAD_ECOLI] [AC:P23524] [GN : YHAD] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 39.1 KD 
PROTEIN IN RNPB-SOHA INTERGENIC REGION (ORF 3)] [SP:P23524] [DB : swissprot] 



ORF Name 



NTID 



AA 



NT 

AAID — _ „ — ^_ scor e probability 
LengthLength c <L 



1AI7503000985 2043313b c3 2142 



"| |930 | [4702 |[gPI~||26fl | 



1.4e-10 



Description 

gp: [GI :dl011987:gl402529] [LN:D78257] [AC:D78257] [PN:ORF8] [GN:Orf8] [OR : Enterococcus 
faecalis] [SR: Enterococcus faecalis plasmid:pYI17 DNA] [DB : genpept -bctl] 
[DE: Enterococcus faecalis plasmid pYI17 genes for BacA, BacB, ORF3 / ORF4 / ORF5, ORF6, 
ORF7, ORF8, ORF9, ORF10, ORF11 , partial cds . ] [LE:1899] [RE: 2261] [DI : complement] 



ORF Name 



IAI7503000985 20484386 ±2 470 



Description 
NO-HIT 



NTID 



„™ NT AA 

AAID — — , score 

LengthLength 



probability 



931 | [4703 |p5~ [|42 | 



ORF Name 



NTID AAID 



NT 



LengthLength 



AA 

— ^ score 



AI7503000985_2 050762 5_Ci_2 004 | |932 | |4704 | pTTmT^ [ | 



probability 
|1.0e-20 



Description 

gp: [GI:g3127078] [LN: AF061070] [AC : AF061070] [PN:PtxB] [GN:ptxB] [OR : Pseudomonas 
stutzeri] [DB : genpept -bet 2] [DE : Pseudomonas stutzeri Orfll7 (orfll7) , Orf86 (orf86) 
genes, completecds; and ptxABCDE operon, partial sequence.] [NT:putative binding 
protein component of] [LE:2345] [RE: 3208] [DI:direct] 
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ORF Name 



NT ID AAID „ _ „ _ 



score 



933 



NT AA 
LenqthLength 
[4705 IfJTT 



probability 
8 . 2e-28 



|AI7503000985_20b096J7_c3_2110 

Description 

gp: [GI:g3676414] [LN : AF051917] [AC : AF051917 : L19570] [PN:unknown] [OR : Staphylococcus 
aureus] [DB : genpept -bet 2 ] [DE : Staphylococcus aureus plasmid pSK41, complete sequence.] 
[NT:Orf423] [LE:755] [RE:2026] [DI: direct] 



ORF Name 



NTID 



NT 

AAID — 



AA 



Length Length 



score 



AI7503000985 20509637 c3 2186 



][ 



[4705 ||1545 | pT" 



TST5" 



probability 
3.1e-170 



Description 

sp: [LN:GNTK_BACLI] [AC:P46834] [GNiGNTK] [OR:BACILLUS LICHENIFORMIS] [EC : 2 . 7 . 1 . 12] 
[DE : GLUCONOKINASE , (GLUCONATE KINASE)] [SP:P46834] [DB : swissprot] >pir : [LN : JC2304] 
[AC:JC2304 ] [PN : gluconate kinase, :gntK protein] [GN:gntK ] [CL :xylulokinase] 
[ORrBacillus lichenif ormis] [EC:2.7.-.-] [DB:pir2] >gp : [GI : dl007073 : g563951] 
[LN : BACGNTBL] [AC:D31631] [ PN : gluconate kinase] [GNrgntK] [OR:Bacillus lichenif ormis] 
[SR:Bacillus licheniformis (strain :BGSC5A2) DNA] [DB : genpept -bet 1] [DErBacillus 
lichenif ormis DNA for hypothetical protein and Gntproteins . ] [LE:1725] [RE: 3266] 
[DI: direct] 



ORF Name 



^I75030009tfb 20515643 t3 1130 



NTID 



_ _ _ NT AA 

AAID — — score 

Length Length 



probability 



Description 
NO-HIT 



j |935 | [4707 ||141 1 |46 | 



ORF Name 



NTID 



NT 



AA 



AAID v 

LengthLength 



score 



IAI7503000985 20524067 ci 1622 



] |936 | |4708 |p^~||212 |pi~ 



probability 
2 . 5e-19 



Description 

pir: [LN:C70041] [AC:C70041 ] [PN : conserved hypothetical protein yvgV] [GN:yvgV ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86036 :g2635861] [LN : BSUB0018] 
[AC:Z99121:AL009126] [GN:yvgV] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis complete genome (section 18 of 21): from 3399551to 3609060.] 
[NT: similar to hypothetical proteins] [LE: 37577] [RE: 38245] [DI : complement] 
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NT AA 

° RF Name NTID AAID Le ~ thLe — th score probability 



A1750300098b_aOt>B9bfea_cl_1392 | |937 | [ 4 7 09 ||15B 7 j pT3 [ p^~[| i . le-21 — | 



Description 

Sp : [LN : TAGE_BACSU] [AC:P13484] [GN: TAGE : RODD : GTAA] [OR : BACILLUS SUBTILIS] [EC : 2 . 4 . 1 . 52] 
[DE:(EC 2.4.1.52) (TEICHOIC ACID BIOSYNTHESIS PROTEIN E) ] [SP:P13484] [DB : swissprot] 
>pir: [LN:S06048] [AC : S06048 : F69720 ] [PN: poly (glycerol -phosphate) 
alpha-glucosyltransf erase, tagE : probable rodD protein :UDP-glucose- -polyglycerol 
phosphate glucosyltransf erase tagE] [GN : tagE : rodD ] [OR: Bacillus subtilis] [EC: 
2.4.1.52] [DB:pir2] [MP:310 degrees ] >gp : [GI :g580920] [LN: BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NT : rodD 
(gtaA) polypeptide (AA 1-673)] [SP:P13484] [LE:157] [RE:2178] [DI:direct] 
>gp: [GI:ell84479:g2636099] [LN: BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose: polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 4 . 1 . 52] [DE:Bacillus subtilis complete 

genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, 

gtaA, gtaD] [SP:P13484] [LE: 80369] [RE: 82390] [DI : complement] 

>gp: [GI:ell84479:g2636099] [LN: BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose:polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR: Bacillus subtilis] [DB:genpept] [EC : 2 . 4 . 1 . 52] [DE: Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, gtaA, gtaD] 
[SP:P13484] [LE: 80369] [RE: 82390] [DI : complement] 

NT AA 

ORF Name NT ID AAID — , — score probability 

LengthLength <c d - 



AI7503000985_2058937J:3_1191 | [938 | [4710 |p3Tg~|| 512 j | 1041 || 3 . 6e-105 



Description 

pir: [LN:C69676] [AC : C69676 : B3 9096 : S16904 : 13 9952 : S18269 ] [PN: alkaline phosphatase, III 
precursor : alkaline phosphatase B (phoB) ] [GN : phoB : phoAI I I ] [CL:alkaline phosphatase] 
[OR:Bacillus subtilis] [EC:3. 1.3.1] [DB:pir2] >gp : [GI : ell82553 : g2632887] [LN:BSUB0004] 
[AC:Z99107:AL009126] [PN:alkaline phosphatase III] [GN: phoB] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC:3. 1.3.1] [DE:Bacillus subtilis complete genome (section 4 of 21): 
from 600701 to813890.] [NT : alternate gene name: phoAIII] [SP:P19405] [LE:19113] 
[RE:20501] [DI : complement] >gp: [GI :dl020477 :gl945090] [LN:D88802] [AC:D88802] [GN :phoB] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168, 
isolate: JH642] [DB : genpept-bctl] [DE:Bacillus subtilis DNA for phoB-rrnE-groESL region, 
complete cds.] [NT : B . subtilis alkaline phosphatase IIIA; P19405] [LE:6115] [RE:7503] 
[DI : complement] 
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NT AA 

ORF Name NT ID AAID LengthLength score probability 



AI750300098b_20822287_cl_1574 | | 939 | [ 4711 | | 584 | [227 | [5?3~ || 1 . 6e-34 



Description 

Sp : [LN : GNTR_BACSU] [AC:P10585] [GN : GNTR] [OR: BACILLUS SUBTILIS] [DE : GLUCONATE OP ERON 
TRANSCRIPTIONAL REPRESSOR (P28 PROTEIN)] [SP:P10585] [DB : swissprot] >pir : [LN : C26190] 

[ACrC26190 rA23537 rE69636 rS10723 ] [PN : transcription repressor of gluconate operon 
gntR:gnt operon regulatory protein] [GN:gntR ] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GIrdl022429rg563933] [LN:AB005554] [AC : ABO 055 54 : D4 5242 : D31629] [PNr gluconate 
operon repressor] [GN:gntR] [ORrBacillus subtilis] [SRrBacillus subtilis (strainrBGSC 
1A1) DNA] [DB :genpept-bctl] [DErBacillus subtilis genomic DNA, 36 kb region between gnt 
and ioloperons.] [NTrPROSITE; PS00043; HTH_GNTR_FAMILY; see SWISS_PROT] [LE:4516] 

[RE:5247] [DI : complement ] >gp : [GI :gl43014] [LNrBACGNT] [AC : J02584 :M24505] [PN:gnt 
repressor] [ORrBacillus subtilis] [SRrBacillus subtilis (strain Marburg 168) DNA] 

[DBrgenpept-bctl] [DE rB . subtilis (gluconate operon) gntR, gntK and gntP genes 
encodinggnt repressor, gluconate kinase and permease, and gntZ gene.] [LEr236] [RE r 967] 

[DIrdirect] >gp: [GI:ell84731rg2 636552] [LN r BSUB0021] [AC r Z99124 r AL009126] 

[PNr transcriptional regulator (GntR family)] [GNrgntR] [FNmegative regulation of the 
gluconate operon] [ORrBacillus subtilis] [DBrgenpept-bctl] [DEr Bacillus subtilis 
complete genome (section 21 of 21): from 3999281to 4214814.] [SPrP10585] [LErll3345] 

[RE r 114076] [DI r direct] 



NT AA 

ORF Name NT ID AAID — . T — , , score proba bility 

LengtnLengtn 

|M75030005S5_20fl92325_ci_i3a9 | |940 | [4712 |p3^~ | [51 | 

Description 

NO-HIT 



NT AA 

ORF Name NT ID AAID — — score probability 

Length Length u 

|AI7503000985_210Ba&_t2_690 | [941 | [ 4713 |p^-| | 73 | 

Description 

NO-HIT 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



IA17503000985 2125000 tl 305 



Description 
NO-HIT 



|942 | [4714 ||T?5 1 141 | 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000985 2135712 c3 2211 



] |943 | [4715 |[XT5T][398 |ggg ||1.5e-^T 



Description 

pirr [LNrH69784] [ACrH69784 ] [PN r chloramphenicol resistance protein homolog ydhL] 
[GNrydhL ] [CL : Streptomyces lividans chloramphenicol resistance protein] [ORrBacillus 
subtilis] [DBrpir2] >gpr [GI rell82559rg2 632893] [LN r BSUB0004] [AC r Z99107 r AL009126] 
[GNrydhL] [FNrunknown] [ORrBacillus subtilis] [DBrgenpept-bctl] [DErBacillus subtilis 
complete genome (section 4 of 21) r from 600701 to813890.] [NTrsimilar to 
chloramphenicol resistance protein] [LEr 24142] [RE r 25419] [DI r complement] 
>gpr [GIrdl020483rgl945096] [LNrD88802] [ACrD88802] [GNrydhL] [ORrBacillus subtilis] 
[SRrBacillus subtilis (sub_species : Marburg, strainrl68, isolate : JH642] 

[DBrgenpept-bctl] [DErBacillus subtilis DNA for phoB-rrnE-groESL region, complete cds.] 
[NTrS. lividans chloramphenicol resistance protein;] [LEr 11144] [RE r 12421] 
[DI r complement] 
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ORF Name 



IAI7503000985 214026 c2 1767 



Description 
NO-HIT 



NTID 



NT AA 
AAID — , — - score 
LengthLength 



J [944 | [4715 |[T25~"]|42 | 



probability 



ORF Name 



[AI7503000985 2148428 F2 892 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID „ — » — ^, score 
LengthLength 

[945 | |4717 ||123 1 14 0 I 



probability 



IAI7503000985 c2 1719 



Description 
NO-HIT 



NT AA 
NTID AAID — _^ — , . score 
LengthLength 



probability 



ORF Name 



^7503000985 21515707 £5 1084 



Description 
NO-HIT 



NT AA 
NTID AAID — ^, , — , _ scor e 
LengthLength 



][ 



5TT 



[4719 



probability 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



IAI7503000985 21517012 c2 1722 



] |948 | [4720 |pi~||135 \^ST 



probability 
1.8e-25 



Description 

gp: [GI:g2318065] [LN: AF012532] [AC : AF012532] [PNiYeeE] [GNryeeE] [OR : Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis strain 168 trpC2 YefA (yefA) gene, partial 
cds,and YefB (yefB) , YefC (yefC) , YeeA (yeeA) , YeeB (yeeB) , YeeC(yeeC), YeeD (yeeD) , 
YeeE (yeeE) and YeeF (yeeF) genes, completecds . ] [LE:8308] [RE:9417] [DI : complement] 



ORF Name 



IAI7503000985 21523377 13 1042 



Description 
NO-HIT 



ORF Name 



IAI7503000985 21553751 13 1315 



Description 
NO-HIT 



NT AA 
NTID AAID — ^ T — , score 
LengthLength 



probability 



] |949 | [4721 ||171~||56 | 



NTID 



!T5u~ 



_ _ NT AA 

AAID — — .score 

LengthLength 



probability 



|4722 ||237 | 



[73" 



ORF Name 



IAI7503000985 21579561 c2 1962 



Description 
NO-HIT 



NT AA 
NTID AAID — -„ — , , score 
LengthLength 



probability 



] [951 | [4 7 23 |p3~ | [ 40 | 



273 



ORF Name 



NT ID 



NT 



AA 



AAID _ . _ 



LengthLength 



score probability 



[Am0300098b 21604040 cl 1522 



-| |9S2 | [ 4 7 24 || 2028 "]|675 | p^F]| 4 . 4e- 132 



Description 

sp : [LN : FEOBJVtET JA] [AC:Q57986] [GN:MJ0566] [OR : METHANOCOCCUS JANNASCHII] [DE : FERROUS 
IRON TRANSPORT PROTEIN B HOMOLOG] [SP.-Q57986] [DB : swissprot] >pir : [LN:F64370] 
[AC:F64370 ] [PN: ferrous iron transport protein B] [CL: ferrous iron transport protein 
B: translation elongation factor Tu homology] [OR :Methanococcus jannaschii] [DB:pir2] 
[MP:REV504509-502503 ] >gp : [GI : gl5 912 72] [LN:U67505] [AC : U6 7505 : L77117 ] [PN:ferrous 
iron transport protein B (feoB)] [GN :MJ0566] [OR : Methanococcus jannaschii] 
[DB : genpept-bct2] [DE : Methanococcus jannaschii section 47 of 150 of the complete 
genome.] [NTrsimilar to SP:P33650 PID:606344 PID:1199515] [LE:10027] [RE:12033] 
[DI : complement] 



ORF Name 



NTID 



_ NT AA 
AAID — , — , , score 
LengthLength 



probability 



IAI7S03000985 21664126 c2 1712 



[353 | [4725 |pT3T] [360 | [1138 ||i . 9e-iiS 



Description 

gp: [GI:el299583 :g3687417] [LN: BLY17554] [AC:Y17554] [PN : ornithine carbamoyl trans f erase] 
[GNrarcB] [OR:Bacillus lichenif ormis] [DB : genpept-bctl] [EC:2.1.3.3] [DE:Bacillus 
licheniformis arcA, arcB, arcC and arcD genes.] [LE:1518] [RE: 2525] [DI: direct] 



ORF Name 



|AI750300098b 21674062 cl 1615 



Description 
NO-HIT 



ORF Name 



IAI7503000985 21678217 c2 1764 



Description 
NO-HIT 



NT 



AA 



NTID AAID — — . _ score probability 
LengthLength c u 



] |954 | [4726 ||135 1 144 | 



NTID 



„ TT ^ NT AA 

AAID — _ — , , score 

LengthLength 



j [4727 j[T?l~||40 "I 



probability 



ORF Name 



NT AA 
NTID AAID — _ „ — , scor e 
LengthLength 



1AI7503000985 21687963 cl 1426 



VST 



[4728 |p7~[[158 [[Im- 



probability 
3.4e-I5 



Description 

gp: [GI:g4980875] [LN: AE001717] [AC : AE001717 : AE000512 ] [PN:arginine repressor] 
[GN:TM0371] [OR : Thermo toga maritima] [DB :genpept-bct2] [DE : Thermotoga maritima section 
29 of 136 of the complete genome.] [NT: similar to GB:M27869 SP:P17893 PID: 142450] 
[LE:2550] [RE:3008] [DI:direct] >gp: [GI : el48 9641 :g5102818] [LN : TMA1322 86] [AC : AJ132286] 
[PN:arginine repressor] [GN:argR] [FN : regulation of arginine biosynthesis genes] 
[OR: Thermotoga maritima] [DB:genpept] [DE : Thermotoga maritima argR gene, strain MSB8 . ] 
[LE:1] [RE:459] [Dlrdirect] 



ORF Name 



NTID 



AAID 



E 



171503000985 21759718 ti 181 



][ 



NT AA 
LengthLength 
2 [4729 |p57 | [98 



score probability 
HZ 110.044 



Description 

gp: [GI:g4049770] [LN : AF063866] [AC : AF063866 ] [PN:ORF MSV254 leucine rich repeat gene 
family] [GN:MSV254] [OR : Melanoplus sanguinipes entomopoxvirus] [DB : genpep t - vr 1 ] 
[DE : Melanoplus sanguinipes entomopoxvirus, complete genome.] [LE: 220798] [RE: 221799] 
[DI : direct] 
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ORF Name 



NT 



AA 



^ MI5 Le - thLe ~ th score probability 



A1750300098b_21774087_c3_2iii | [958 | [ 4 7 30 || 732 | [243 | pTg~ | |1 . Be- 30 



Description 

sp: [LN:Y882_HAEIN] [AC:P44068] [GN:HI0882] [OR : HAEMOPHILUS INFLUENZAE] [DE : HYPOTHETICAL 

PROTEIN HI0882] [SP:P44068] [DB : swissprot] >pir : [LN : E64015] [AC:E64015 ] 

[PN: hypothetical protein HI0882] [OR : Haemophilus influenzae] [DB:pir2] 

>gp: [GI :gl573906] [LN:U32770] [AC : U32 770 : L4 2023] [PN:H. influenzae predicted coding 

region HI0882] [GN:HI0882] [OR : Haemophilus influenzae Rd] [DB :genpept-bct2] 

[DE : Haemophilus influenzae Rd section 85 of 163 of the complete genome.] 

[NT: hypothetical protein; identified by GeneMark;] [LE:1177] [RE:1914] [DIrdirect] 



ORF Name 



^7503000985 22275052 ti 248 



Description 
NO-HIT 



ORF Name 



E 



17503000985 22296927 ti 418 



Description 
NO-HIT 



ORF Name 



IAI7503000985 22304578 c2 1792 



Description 
NO-HIT 



ORF Name 



[AI7503000985 22345265 ti 27 



Description 
NO-HIT 



NTID 



^ m NT AA 

AAID — » — , , score 

Length Length 



probability 



4731 



NTID 

] eee: 



__ T „ NT AA 

AAID — , — . , score 

Length Length 

[4732 |p^B — [ |55 | 



probability 



NTID 



_ _ NT AA 

AAID — — score 

Length Length 



probability 



"| [961 | [4733 ||4B | 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



j [952 | [4734 |p^~| p~ 



ORF Name 



NTID 



_ NT AA 
AAID — — score 
LengthLength 



[AI7503 



00098b 22380343 t2 827 



][ 



[4735 "[[1983 | |SSu~ 



35T 



probability 
2.3e-30 



Description 

sp : [LN : ALYS_ENTFA] [AC:P37710] [OR : ENTEROCOCCUS FAECALIS] [SR :, STREPTOCOCCUS FAECALIS] 
[EC: 3. 5. 1.28] [DE : AUTOLYSIN, (N-ACETYLMURAMOYL-L- ALANINE AMIDASE) ] [SP:P37710] 
[DB: swissprot] >pir: [LN:A38109] [AC:A38109 ] [PN : autolysin] [OR : Enterococcus faecalis] 
[DB:pir2] >gp: [GI:g829194] [LN : STRHYDROLA] [AC:M58002] [PN : bacterial cell wall 
hydrolase] [OR : Enterococcus faecalis] [SR : Streptococcus faecalis DNA] [DB : genpept-bctl] 
[DE: Streptococcus faecalis bacterial cell wall hydrolase gene, completecds . ] [LE:536] 
[RE:2551] [DI :direct] 



NT AA 

ORF Name NTID AAID — - — _ score probability 
LengthLength c L - 



IAI7503000985 2242136 c3 2048 



964 | [4736 |p^]|227 | pi~ [|i . 2e-56 



Description 

gp: [GI:e303881:gl850807] [LN:CPCPEAA] [AC:X71844] [PN:putative transposase] 
[OR Clostridium perfringens] [DB : genpept-bctl] [DE : C . perf ringens uapC, cpe, and nadC 
genes.] [LE:2477] [RE: 2932] [DIrdirect] 
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NT AA 

QRF Name *™ AMD L e^thLe^th ^^ probability 

p7503000985J>24bil8&_t3_989 | |965 | |473V | |lb9 ~"|| 52 | 

Description 

NO-HIT 



ORF Name 



NTID 



„ T¥% NT AA 

AAID — . — , , score 

LengthLength 



E 



17503000985 22460302 cl 1511 



966 | [4738 |25i 



probability 
3.0e-30 



Description 

gp: [GI:g2735514] [LN : SCU96108] [AC:U96108] [PN:SceA precursor] [GN: sceAj 
[OR: Staphylococcus carnosus] [DB : genpept -bct2 ] [DE : Staphylococcus carnosus 
(3R) -hydroxymyristoyl acyl carrier proteindehydrase homolog (fabZ) gene, partial cds, 
YwpF homolog, single -strand binding protein homolog (ssb) , SceD precursor (sceD),SceA 
precursor (sceA) and SceE precursor (sceE) genes, completecds, and TenA homolog (tenA) 
gene, partial cds . ] [NT: secreted protein] [LE:2736] [RE: 3449] [DI: direct] 



ORF Name 



xttt,^ »*-r~ NT AA 

NTID AAID — — , - score 

LengthLength 



probability 



IAI7503000985 22655300 E2 881 



J [957 | [4739 ||1542 | [513 | [1508 ||1 . 2e-154 



Description 

gp: [GI:dl039113 :g4514332] [LN : AB013369] [AC : AB013369] [OR:Bacillus halodurans] 
[SR:Bacillus halodurans (strain : C-125) DNA] [DB :genpept-bctl] [DE:Bacillus halodurans 
C-125 yesT and comEC genes, partial andcomplete cds.] [NT:unknown] [LE:4328] [RE: 5830] 
[DI: direct] 



ORF Name 



IAI7503000985 22735877 El 1047 



Description 
NO-HIT 



ORF Name 



[AI7503000985 23438751 12 550 



Description 
NO-HIT 



NTID 



968 



NTID 



„ TO NT AA 

^ Le n^thLen^th ^°^ 



probability 



[4740 ||141 | 



_ _ NT AA 

AAID — — score 

LengthLength 



probability 



[4741 ||228 | 



IT 



ORF Name 



IAI7503000985 234432 ci 1440 



Description 
NO-HIT 



ORF Name 



AI7503000985_23445387_c3_i998 

Description 

NO-HIT 



_ ___ _ NT AA 

NTID AAID — _ „ — , _ score 

L en gthLength 



probability 



"j |970 | [4742 |[T5u~||49 | 



NTID 



AA 



NT 

AAID — _ — , . score 
LengthLength 



probability 
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NT AA 

0RF Name *™ L e^thLe^th ^^ probability 

|AlVb03000985jm68942J:lj;9 | | 972 | [ 4744 | | 1101 1 | 365 | | 928 | |3.4e-iU — [ 

Description 

pir: [LN:A69984] [AC:A69984 ] [PN: endo-1 , 4 -beta-glucanase homolog ysdC] [GNrysdC ] 
[CL: thermophilic aminopeptidase I alpha chain] [OR .-Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell84131:g2635347] [LN : BSUB0015] [AC : Z99118 : AL009126] [GNrysdC] [FNrunknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome {section 
15 of 21): from 2795131to 3013540.] [NTrsimilar to endo-1, 4 -beta-glucanase] [LE:154199] 
[RE:155284] [D I : complement] >gp: [GI : ell65304 : gl770012] [LN : BSZ75208] [AC:Z75208] 
[PN: hypothetical protein] [GN:ysdC] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genomic sequence 89009bp.] [NT:homology to celA of Clostridium 
thermocellum and] [LE: 14623] [RE: 15708] [DI: direct] 

NT AA 

ORF Name NTID AAID — _^ — , _ score probability 
LengthLength c 



Al7503000985J234752bl_c2J.952 | | 973 | [ 4745 || 141 1 1 46 | 



Description 
NO-HIT 



ORF Name 



NTID 



^ Tr , NT AA 

AAID — — score 

LengthLength 



probability 



|Al7b03000985 2547532b cl 1479 



][ 



T7T 



Description 
NO-HIT 



|4746 |[T^ | |52 | 



ORF Name 



IAI7503000985 23476503 ±3 1228 



Description 
NO-HIT 



NTID 



][ 



NT AA 
AAID — — _ score 
LengthLength 



probability 



|4747 



I 



62 



ORF Name 



NTID 



NT AA 
AAID — — _ score 
LengthLength 



IA17503000985 23476702 cl 1496 



] I 976 | I 4748 |p^~||293 |pr 



probability 
6 . 6e-58 



Description 

sp: [LN:YWBI_BACSU] [AC:P39592] [GN : YWBI : IPA-24D] [OR:BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL TRANSCRIPTIONAL REGULATOR IN THIK-EPR INTERGENIC REGION] [SP:P39592] 
[DB:swissprot] >pir : [LN: S39679] [AC : S39679 : G70051 ] [PN : transcription regulator homolog 
ywbI:protein ipa-24d] [GN:ywbI ] [CL:probable transcription regulator lsyR] 
[OR: Bacillus subtilis] [DB:pir2] >gp : [GI : g413948] [LN : BSGENR] [AC:X73124] [GN:ipa-24d] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] 
[SP:P39592] [LE:24460] [RE:25365] [DI:direct] >gp : [GI :ell86330:g2 636366] [LN : BSUB0020] 
[AC:Z99123 :AL009126] [GN:ywbI] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT alternate gene name: ipa-24d; similar to] [SP:P39592] [LE: 132594] [RE: 133499] 
[DI : complement] 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



1AI7503000985 23479702 c3 2053 



Description 
NO-HIT 



j |977 | |4749 ||273 | |90 | 



277 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



1A17503000985 23480467 cl 1549 



][ 



TUT 



[ 4 7 50 || 512 ~||303 || 862 || 3.4e-86 



Description 

pir: [LN:G69879] [AC:G69879 ] [PN:L-serine dehydratase homolog ylpA] [GN:ylpA ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85177 :g2633958] [LN: BSUB0009] 
[AC:Z99112 :AL009126] [GN : ylpA] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT: similar to L-serine dehydratase] [LE: 59909] [RE: 60811] [DI: direct] 

>gp: [GI:e323528:g2337815] [LN : BSY13937] [AC:Y13937] [PN:putative YhaP protein] 

[GN: ylpA] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 

genomic DNA from the spoVM region.] [LE: 21681] [RE: 22583] [DI: direct] 



ORF Name 



|A17503 000y8b_23b205_c2_184 5 



Description 
NO-HIT 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



ORF Name 



03000985 23553275 ±3 1064 



Description 
NO-HIT 



NTID AAID 



][ 



NT AA 
LengthLength 
j |4752 ||T?5 j |4i | 



score probability 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score 



IAI7503000985 23556338 ci 1568 



] P 1 I |4753 |pji~||ioo [ pmr 



probability 
7.2e-13 



Description 

Sp: [LN : YCNE_BACSU] [AC:P94425] [GN: YCNE] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 10.9 

KD PROTEIN IN PHRC-GDH INTERGENIC REGION] [SP:P94425] [DB : swissprot] >pir : [LN : A69764] 
[AC:A69764 ] [PN : conserved hypothetical protein ycnE] [GN : ycnE ] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI :ell82354:g2 632688] [LN: BSUB0003] [AC : Z99106 : AL009126] [GN:ycnE] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis complete 

genome (section 3 of 21): from 402751 to611850.] [NT:similar to hypothetical proteins] 
[SP:P94425] [LE:36112] [RE:36399] [DI : complement ] >gp : [GI : dl009654 : gl805457] 
[LN:D50453] [AC:D50453] [GN:ycnE] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: 168 trpC2) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA for 25-36 degree 

region containing theamyE-srfA region, complete cds . ] [LE: 118515] [RE: 118802] 
[DI : complement] 



ORF Name 



^17503000985 23593800 ci 1493 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



] |982 | [4754 |p72~| |313 ||141 ||fl.3e-0fl 



Description 

pir: [LN:D69778] [AC:D69778 ] [PN: hypothetical protein ydeJ] [GN:ydeJ ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl020110 :gl881330] [LN : AB001488] [AC : AB001488] [GN:ydeJ] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB : genpept -bet 1 ] 
[DE:Bacillus subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 
degree.] [NT: FUNCTION UNKNOWN . ] [LE: 102509] [RE: 103168] [D I : complement] 
>gp: [GI:ell82488:g2632822] [LN: BSUB0003] [AC : Z99106 : AL009126] [GN:ydeJ] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [LE:166254] [RE:166913] [DI : complement] 
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ORF Name 



|AI750300098E> 2^b9b262 t2 695 



Description 
NO-HIT 

ORF Name 



IAI7503000985 23595312 £3 948 



Description 
NO-HIT 



NT AA 
NTID AAID — — t _, score 
LengthLength 



probability 



Jj |983 | [4755 ||121~| |40 ~| 



NTID 
] [984 



NT AA 
AAID — , — score 
LengthLength 



probability 



|4756 ||132 | j¥T" 



ORF Name 



NTID 



NT 

AAID _ — 



AA 



LengthLength 



score 



E 



iVb03000985 23600412 F3 960 



] |985 | [4757 jpi7]|413 |pr 



probability 
i.0e-52 



Description 

pir: [LN:E69783] [AC:E69783 ] [PN :bicyclomycin resistance protein homolog ydgK] [GN:ydgK 
] [CL:bicyclomycin resistance protein] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI :dl020154 :gl881374] [LN : AB001488] [AC : AB001488] [GNiydgK] [OR:Bacillus subtilis] 
[SR: Bacillus subtilis (strain: 168) DNA] [DB : genpept-bctl] [DE: Bacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT : SIMILAR TO 
BICYCLOMYCIN RESISTANCE PROTEIN.] [LE:146860] [RE:148068] [DI:direct] 

>gp: [GI:ell82547:g2632881] [LN : BSUB0004] [AC : Z99107 : AL009126] [GN:ydgK] [FNrunknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
4 of 21): from 600701 to813890.] [NT:similar to bicyclomycin resistance protein] 
[LE:12656] [RE:13864] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength probability 



&I7B030009flB_235007S2_ti_2iS | |986 | [4758 |pTu~| | 79 || 73 | [ 



0.014 



Description 

pir: [LN:S69592] [AC:S69592 ] [PN : hypothetical protein YDR509w] [OR : Saccharomyces 
cerevisiae] [DB:pir2] [MP:4R ] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



1A17503000985 2360201B c2 1892 



] |987 | [ 4759 |p7~[| 228 | | 374 || 1.7e-34 



Description 

sp : [LN : YBBL_ECOLI] [AC:P77279] [GN : YBBL] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL ABC 
TRANSPORTER ATP -BINDING PROTEIN YBBL] [SP:P77279] [DB : swissprot] >pir : [LN : A64780] 
[AC:A64780 ] [PN:probable ABC-type transport protein ybbL:probable ABC transporter, 
ATP-binding protein ybbL] [GN:ybbL ] [CL : ATP-binding cassette homology] [OR Escherichia 
coli] [DB:pir2] >gp : [GI :gl773172] [LN: ECU82664] [AC:U82664] [OR : Escherichia coli] 
[DB: genpept-bctl] [DE : Escherichia coli minutes 9 to 11 genomic sequence.] 
[NT hypothetical protein] [LE:95367] [RE:96044] [DI:direct] >gp : [GI :gl786698] 
[LN:AE000155] [AC : AE000155 :U00096] [PN:putative ATP-binding component of a transport] 
[GN:ybbL] [FN: putative transport; Not classified] [OR : Escherichia coli] 
[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 45 of 400 of the 
completegenome.] [NT:o225; This 22 5 aa ORF is 32 pet identical (7 gaps)] [LE:443 9] 
[RE:5116] [DI:direct] 
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ORF Name 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



[A1750300098b 23604052 tl 209 



[988 | [4750 jpifl | [IPS |pr 



probability 
10.0020 



Description 

sp: [LN:YE1A_METJA] [AC:P81328] [GN : MJ1417 . 1] [OR : ME THANO COCCUS JANNASCHII] 
[DE: HYPOTHETICAL PROTEIN MJ14 17 . 1] [SP:P81328] [DB : swissprot] >gp : [GI :g2826408] 
[LN:U67582] [AC : U6 7582 : L77117] [PN:M. jannaschii predicted coding region MJ1417.1] 
[GN:MJ1417 . 1] [OR : Methanococcus jannaschii] [DB : genpept-bct2] [DE :Methanococcus 
jannaschii section 124 of 150 of the complete genome J [NTrBrute Force ORF; identified 
by GeneMark; putative] [LE:4562] [RE:4999] [DI:direct] 



ORF Name 



^£7503000985 23G20252 ci 1606 



Description 
NO-HIT 



ORF Name 



IAI7503000985 23625005 c3 2103 



Description 
NO-HIT 



ORF Name 



[AI7503000985 23631311 c2 1820 



Description 
NO-HIT 



ORF Name 

|Ai7503000985_23635926J:2_538 

Description 

NO-HIT 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



[4761 |[TT8 | 



XTrT , T _ NT AA 

NTID AAID — » — t , score 

Length Length 



probability 



j |990 [ |4762 ||TTg | 



NTID 



^ m NT AA 

AAID — — , score 

LengthLength 



probability 



4763 



297 I [SS 



NTID 



7V7VTT , NT AA 

AAID — — , , score 

LengthLength 



probability 



[4754 ||T¥3 | |TT 



ORF Name 



IM7503000985 23636343 cl 1383 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



[4765 ||813 ||270 | [409 | |! 



probability 
3.4e-38 



Description 

sp: [LN:PHNE_ECOLI] [AC : P16683 : Q47479 : P76792 : Q47716] [GN : PHNE] [OR : ESCHERICHIA COLI] 
[DE : PHOSPHONATES TRANSPORT SYSTEM PERMEASE PROTEIN PHNE] 
[SP:P16683 : Q4 7479 : P76 792 :Q47716] [DB : swissprot] >pir : [LN:F35718] 

[AC:F35718:A42732:S56332:S56331:G65219:F65219 ] [PN:phnE protein : hypothetical protein 
b4103] [GNrphnE ] [CL.-phnE protein] [OR : Escherichia coli] [DBrpirl] >gp : [GI : gl47198] 
[LN : ECOPHNAQ] [AC:J05260] [OR : Escherichia coli] [SRrE.coli (strain B) DNA] 
[DB:genpept-bctl] [DErE.coli psiD locus containing alkylphosphonate uptake (phn) genes 
Athrough Q, complete cds . ] [NT :phnE protein] [LE:6227] [RE: 7057] [Dlidirect] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



AI7503000985_23642942_c3_2087 

Description 

NO-HIT 



][ 



|4766 | |75~ 
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ORF Name 



NT ID 



NT 



AA 



AAID . , „ 



Length Length 



score 



2 |995 I [4767 [[2637 | [878 [[ZTT 



probability 
|9.2e-15 



Description 

gp: [GI:g4049717] [LN: AF063866] [AC : AF063866 ] [PN:ORF MSV156 hypothetical protein] 
[GN:MSV156] [OR : Melanoplus sanguinipes entomopoxvirus] [DB :genpept-vrl] [DE :Melanoplus 
sanguinipes entomopoxvirus, complete genome.] [LE:140126] [RE-.143509] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ — _ score 
Length Length 



probability 



IAI7503000985 23572518 t3 1051 



] P g | [4758 |pTT]p36 ||529 ||6.5e-5i 



Description 

pir: [LN:F69659] [AC:F69659 ] [PN : molybdopterin biosynthesis protein moeB] [GN:moeB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85017 :g2633798] [LN: BSUB0008] 
[AC:Z99111:AL009126] [PN : molybdopterin biosynthesis protein] [GN: moeB] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE: Bacillus subtilis complete genome (section 8 of 21) : 
from 1394791to 1603020.] [LE:100770] [RE:101789] [DI: direct] >gp : [GI : g3282111] 
[LN:AF012285] [AC : AF012285 : AF012284 :U51911] [PN : molybdopterin biosynthesis protein 
MoeB] [GN : moeB] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 
mobA-nprE gene region.] [NT: member of the moeB/hesA/thiF family; similar to] [LE:1258] 
[RE:2277] [DIrdirect] 



ORF Name 



NTID 



NT AA 

AAID — — . score probability 
LengthLength 



|AI7503000985_23680300_rl_330 



Description 
NO-HIT 



] I 557 1 |4759 1P^~| |315 | 



ORF Name 



IAI7503000985 23710811 ci 1425 



Description 
NO-HIT 



NTID AAID 

] 



NT 



AA 



4770 



LengthLength 



score probability 



1ED 
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ORF Name 



NT AA 
NTID AAID — __ — , score 
LengthLength 



probability 



IAI7503000985 23828253 tl 81 



Description 
NO -HIT 



ORF Name 



NTID AAID — — , score 
LengthLength 



probability 



IAI7503000985 23866552 t2 815 



J [1000 | [4772 |pTI | [46 [ 



Description 
NO-HIT 



ORF Name 



NTID AAID . — — , _ score 
LengthLength 



probability 



AI7503000985JS3868887_c2_1828 

Description 

NO-HIT 



] [1001 I [4773 | 



1189 I WI 



281 



NT AA 

ORFName NTID AAID Le ~ thLe — th probability 



|Ai7S0300096&J*401!>ba7_clJ.475 | [ 1002 | [4774 | |152i | p5 1 [ 2197 | [i . le-22V 

Description 

gp: [GI :g4574120] [LN : AF009415] [AC : AF009415] [PN:glycine betaine aldehyde 
dehydrogenase] [GN: cudA] [OR : Staphylococcus xylosus] [DB : genpept-bct2] 
[DE : Staphylococcus xylosus choline transporter (cudT) , putativeregulatory protein 
(cudC) , glycine betaine aldehyde dehydrogenase (cudA) , and choline dehydrogenase (cudB) 
genes, complete cds . ] [NT : CudA] [LE:3363] [RE:4856] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c L - 

AI7S030009flS_240i$026_tl_330 | [1003 | [ 4775 | |156 | |5i | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|AI75030009ab_24022177_ci_157b | [1004 | [ 4776 ||rTgg~| | 452 | [ 1499 | | i . le-lbJ | 

Description 

Sp : [LN : GNTP_BACSU] [AC:P12012] [GN : GNTP] [ORiBACILLUS SUBTILIS] [DE : GLUCONATE PERMEASE] 
[SP:P12012] [DB:swissprot] >pir : [LN : A26190] [AC : A26190 : D6 9636 ] [PN: gluconate permease 
gntP] [GN:gntP ] [CL:D- serine permease] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl022427:g563931] [LN : AB005554] [AC : AB005554 : D45242 : D3 1629] [PN : gluconate 
permease] [GNrgntP] [ORrBacillus subtilis] [SRrBacillus subtilis (strain:BGSC 1A1) DNA] 
[DB:genpept-bctl] [DErBacillus subtilis genomic DNA, 36 kb region between gnt and 
ioloperons.] [NTrhomologs are found in E. coli and H. influenzae;] [LE:1607] [RE:2953] 
[DI : complement] >gp: [GI:gl43016] [LN : BACGNT] [AC: J02584 :M24505] [PN : permease] 
[OR: Bacillus subtilis] [SR: Bacillus subtilis (strain Marburg 168) DNA] 
[DB :genpept-bctl] [DE :B . subtilis (gluconate operon) gntR, gntK and gntP genes 
encodinggnt repressor, gluconate kinase and permease, and gntZ gene.] [LE:2530] 
[RE:3876] [DI:direct] >gp: [GI :ell84733:g2 636554] [LN:BSUB0021] [AC : Z99124 : AL009126] 
[PN: gluconate permease] [GN:gntP] [FN: gluconate utilization] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 21 of 21) : from 
3999281to 4214814.] [SP:P12012] [LE:115639] [RE:116985] [DI: direct] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c *■ 



|Al7B030009flS_24070ib6_t2_fl7fl | [1005 | [ 4777 | 164 || 2S0 || i.6e-24 



Description 

pir: [LN:A69849] [AC:A69849 ] [PN : hypothetical protein yjdF] [GN:yjdF ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI :ell83223:g2 633557] [LN : BSUB0007] [AC : Z99110 : AL009126] 

[GN:yjdF] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:80891] [RE:81373] 

[DI : direct] 



282 



ORF Name 



NTID 



NT 



AA 



AAID _ ^ _ 



Length Length 



score 



F 



1750300098b ^4087760 C2 1742 



^ [loos | [4778 |prn— |prr 



probability 
|2.5e-65 



Description 

pir: [LN:F69997] [AC:F69997 ] [PN : hypothetical protein ytnM] [GN:ytnM ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell84177 :g2635393] [LN: BSUB0015] [AC : Z99118 : AL009126] 
[GNrytnM] [FN:unknown] [OR:Bacillus subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 15 of 21): from 2795131to 3013540.] [LE:202739] [RE:203641] 
[DI: complement] >gp: [GI : ell85801 : g2635412] [LN: BSUB0016] [AC : Z99119 : AL009126] [GN:ytnM] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpep t - be 1 1 ] [DE:Bacillus subtilis complete 
genome (section 16 of 21): from 2997771to 3213410.] [LE:99] [RE:1001] [DI : complement] 
>gp: [GI:g2293257] [LN : AF00822 0] [AC : AF008220] [PN: YtnM] [GN:ytnM] [OR:Bacillus 
subtilis] [DB: genpep t-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic region.] 
[NT: similar to a hypothetical protein] [LE : 179426] [RE: 180328] [DI: direct] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score probability 



|A:i:7503000985_24095327_c2_i770 

Description 

NO-HIT 



J [1007 | [4775 |p5~ 



ORF Name 



E 



NT AA 
NTID AAID — _ — , score 
LengthLength 



probability 



17503000985 24101701 cl 1485 



] [1008 | [4780 ||i54r||bl3 | p3^]|2 . 9e-158 



Description 

gp: [GI:dl039113:g4514332] [LN : AB013369] [AC : AB01336 9] [OR:Bacillus halodurans] 
[SR:Bacillus halodurans (strain : C-125) DNA] [DB : genpept -bctl ] [DE:Bacillus halodurans 

C-125 yesT and comEC genes, partial andcomplete cds . ] [NT: unknown] [LE:4328] [RE: 5830] 
[DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — « — , , score 
LengthLength 



probability 



^7503000985 24220260 c3 2109 



] [1009 | [4781 |pu3T]p5i ||121 ||0. 00034 



Description 

gp: [GI:e316518:g2230824] [LN : DDSTATFIR] [AC:Y13097] [PN : STAT protein] [GN : dstA] 
[FN: regulates stalk cell differentiation] [OR : Dictyostelium discoideum] 

[DB: genpept- invl] [DE :D . discoideum mRNA for stat protein, first finger stage.] [LE:507] 
[RE: 2630] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — — , score 
LengthLength 



17503000985 24220290 t2 582 



TUTTT 



[4782 |pre2~||493 



probability 
S.2e-94 



Description 

Sp: [LN : ALDA_ECOLI] [AC:P25553] [GN : ALDA : ALD] [OR : ESCHERICHIA COLI] [EC : 1 . 2 . 1 . 22] 

[DE: ALDEHYDE DEHYDROGENASE A, ( LACTALDEHYDE DEHYDROGENASE)] [SP:P25553] [DB : swissprot] 



ORF Name 



[A17503000985_24224037_c2_i825 

Description 

NO-HIT 



ORF Name 



AI7503000985_242250i5_£3_i098 

Description 

NO-HIT 



NTID AAID — . , score 
LengthLength 



NTID 



probability 



] I 1011 I | 4783 IF*~ 



„ T „ NT AA 

AAID — — , score 

LengthLength 



probability 



] [1012 | [4784 |p5~ ||55 | 



283 



ORF Name 



NT ID 



NT AA 
AAID — , — ^, score 
LengthLength 



probability 



lAI750300098b 2422b01b t3 1127 



1013 | [4785 ||141—||46 | 



Description 
NO-HIT 



ORF Name 



NT ID 



NT 



AA 



AAID , « _ — . _ 

LengthLength 



score 



r 



17503000985 24226577 c3 2184 



[4786 ||g35~||277 |[X5T 



probability 
9.4e-ll 



[CL: transcription repressor glnR] 
[ LN : BMMERAR2 ] [AC:Y09907] 



Description 

sp : [LN : MERR_BACSR] [AC:P22853] [GN:MERR] [ORiBACILLUS SP] [SR:RC607,] [DE : MERCURIC 
RESISTANCE OPERON REGULATORY PROTEIN] [SP:P22853] [DB : swissprot] >pir : [LN : A3222 7] 
[AC:A32227 ] [PN : hypothetical protein 1 (mer operon) ] 
[OR:Bacillus sp.] [DB:pir2] >gp : [GI : e301549 :g2995399] 
[PN: regulatory protein] [GN:merR] [OR: Bacillus megaterium] [DB :genpept-bctl] 
[DE: Bacillus megaterium ORF2, ORF3 , ORF4 , merR and merA genes.] [LE.-574] [RE: 972] 
[DI:direct] >gp: [GI :gll29093] [LN:AF138877] [AC : AF138877 :M22708 :M22709 : AH003258] 
[PN:mercury resistance operon negative regulator] [GN:merRl] [OR:Bacillus sp. RC607] 
[DB:genpept-bct2] [DE:Bacillus sp . RC607 mercury resistance operon, complete sequence.] 
[LE:827] [RE: 1225] [DI:direct] 



ORF Name 



11 7 503000985 24250177 cl 1630 



NT AA 
NT ID AAID — _ — _ score 
LengthLength 



probability 



J [1015 | [4787 |[7TT-||237 | pi - 1|4 . 7e-64 



Description 

gp: [GI:el429599:g4756153] [LN:A67161] [AC:A67161] [FN:NARJ GENE] [OR : Staphylococcus 
carnosus] [DB :genpept-pat] [DE: Sequence 1 from Patent EP0805205.] [NT:unnamed protein 
product] [LE:9374] [RE:9949] [DI:direct] >gp : [GI :g3929524] [LN : AF029224] 
[AC:AF029224:AF029225] [PN:NarJ] [GN:narJ] [OR : Staphylococcus carnosus] 

[DB : genpept -bct2 ] [DE : Staphylococcus carnosus nir and nar operons, complete sequences.] 
[NT: similar to Escherichia coli nitrate reductases NRA] [LE: 11559] [RE: 12134] 
[DI: direct] 



ORF Name 



NT ID 



AAID 



IA17503000985 242512b! c2 1868 



Description 
NO-HIT 

ORF Name 



TUTE~ 



NT AA 
LengthLength 
[4788 ||129 | [42 | 



— . , score probability 



NT ID 



NT 



AA 



UC7503000985 24251400 t2 508 



TUTT 



^ Len^thLen^th ^g probability 
T5T 



2~ST~ 



721 ||2.9e-71 



Description 

gp: [GI:gl854577] [LN : STALYTS] [AC:L42945] [GN: lytR] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE : Staphylococcus aureus lytS and lytR genes, complete cds . ] 
[LE:1849] [RE:2589] [DI:direct] 



ORF Name 



>TfT1TT ^ , ,™ NT AA 

NTID AAID — , — , score 

LengthLength 



probability 



AI750300098b_24251635_t3_1143 

Description 

NO-HIT 



|479Q |[T21 | [40 I 



284 



ORF Name 



NTID 



NT AA 
AAID — . T — u , score 
LengthLength 



probability 



IAI7503000985 24261062 ci 1455 



1019 | [4791 ||1773||590 | [1994 ||3 . 7e-206 



Description 

pir: [LN:F70040] [AC:F70040 ] [PNisulfite reductase homolog yvgQ] [GNryvgQ ] [CL:sulfite 
reductase (f erredoxin) ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86031 :g2635856] 
[LN:BSUB0018] [AC : Z99121 : AL009126] [GN: yvgQ] [FNrunknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 18 of 21) : from 
3399551to 3609060.] [NT:similar to sulfite reductase] [LE:30110] [RE:31825] 
[DI : complement] 



ORF Name 



NTID 



_ _ NT AA 



117303000985 24269812 11 414 



] [1020 | [4792 |p7~|pTTF 



probability 
4.7e-25 



Description 

gp: [GI:el312907:g3355681] [LN:SC1C2] [AC : AL031124] [PN : hypothetical protein SC1C2 . 14c] 
[GN:SC1C2 .14c] [OR : Streptomyces coelicolor] [DB : genpept-bctl] [DE : Streptomyces 
coelicolor cosmid 1C2 . ] [NT: SC1C2 . 14c, unknown, len: 185 aa; similar to] [LE: 14959] 
[RE : 15516] [DI : complement] 



ORF Name 



™„ NT AA 
NTID AAID — — score 
Length Length 



probability 



IA17S03000985 24273375 c2 1749 



] [1021 I [4793 |pnr-|| 199 | [7g9~ || 1 . 8e-78 



Description 

gp: [GI :g4574118] [LN : AF009415] [AC : AF009415] [PN:choline transporter] [GN : cudT] 
[OR staphylococcus xylosus] [DB :genpept-bct2] [DE : Staphylococcus xylosus choline 
transporter (cudT) , putativeregulatory protein (cudC) , glycine betaine aldehyde 
dehydrogenase (cudA) , and choline dehydrogenase (cudB) genes, complete cds . ] [NT: CudT] 
[LE:811] [RE:2433] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — — score 
LengthLength 



probability 



1AI7503000985 24275187 12 858 



1022 | [4794 ||T?g~~||42 "[ 



Description 
NO-HIT 



ORF Name 



IA27503000985 24303775 12 444 



Description 
NO-HIT 



NTID 



7v7vxr ^ NT AA 
^ LengthLength 



probability 



TU7T 



[4795 [[165 | [54 | 



ORF Name 



^17503000985 24304187 c2 1728 



NTID 



AAID 



Description 
NO-HIT 



TT9F" 



NT AA 
LengthLength 
153 I |5B — 



score probability 



285 



ORF Name 



NTID 



NT 

AAID . — 



AA 



LengthLength 



score probability 



U7503000985 24J04712 ±1 311 



1025 | [ 4 7 9 7 | [16 7 4 "]|557 | [1015 || 2 . le-102 



.74] 



Description 

sp: [LN:DCIP_ENTCL] [AC:P23234] [GN:IPDC] [OR : ENTEROBACTER CLOACAE] [EC:4.1.1, 
[DE : DECARBOXYLASE ) ] [SP:P23234] [DB : swissprot] >pir : [LN: S16013] [AC:S16013 ] 
[PN: indolepyruvate decarboxylase,] [CL : acetolactate synthase large chain : thiamine 
pyrophosphate-binding domain homology] [OR : Enterobacter cloacae] [EC: 4. 1.1.-] [DB:pir2] 
>gp: [GI:dl014947:g216677] [LN : ENTIPDC] [AC:D90214] [OR : Enterobacter cloacae] [SR:E. 
cloacae (strain FERM BP-1529) genomic DNA] [DB :genpept-bctl] [DE:E. cloacae gene for 
indolepyruvate decarboxylase.] [NT : indolepyruvate decarboxylase] [LE:31] [RE: 1689] 
[DI : direct] 



ORF Name 



NTID AAID — — score probability 



[AI7S0 



300098!. 24333286 t2 802 



Description 
NO-HIT 



NT AA 
LengthLength 
[1026 | [4798 ||T?5 | |£T 



ORF Name 

|A17b03000985_2433779i_c3_2068 

Description 

NO-HIT 



1027 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



|4799 |p^3~| |T7u~ 



ORF Name 



NT 

NTID AAID 



AA 



LengthLength 



score 



|4800 ||1890|p9 |[T75F 



probability 
14 . 6e-185 



[AI7503000985_24337807_c3_207g 

Description 

pir: [LN:G70040] [AC:G70040 ] [PN:sulfite reductase homolog yvgR] [GN:yvgR ] [CL:sulfite 
reductase (NADPH) : f lavodoxin homology: NADPH- - f errihemoprotein reductase homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86032 :g2635857] [LN:BSUB0018] 
[AC:Z99121:AL009126] [GN:yvgR] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 18 of 21): from 3399551to 3609060.] 
[NT:similar to sulfite reductase] [LE:31851] [RE:33668] [DI : complement] 



ORF Name 



NTID AAID 



117503000985 24338217 c2 1747 



TU7T 



NT AA 
LengthLength 
597 I |TSB~" 



score 



T7T 



probability 
2.8e-34 



Description 

gp: [GI :g4098082] [LN: LLU73336] [AC:U73336] [PN: anaerobic ribonucleotide reductase 
activator] [GN:nrdG] [OR : Lactococcus lactis] [DB :genpept-bct2] [DE : Lactococcus lactis 
anaerobic ribonucleotide reductase (nrdD) andanaerobic ribonucleotide reductase 
activator protein (nrdG) genes , complete cds . ] [NT :NrdG] [LE:2413] [RE: 3012] [DI:direct] 



ORF Name 



H7503000985 24351562 ±3 1230 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



Description 
NO-HIT 



][ 



TUTtT 



[4802 



286 



NT AA 

ORF Name NTID AAID Le ^ t fr Le ^ t fr score probability 



AI750300098b_243bJ390_c2_1882 | [ 1031 | [4803 |pg"~ ||231 || 42i || l.Be-3S 



Description 

pir : [LN:S76993] [AC:S76993 ] [PN: hypothetical protein] [CL:ribitol 
dehydrogenase: short -chain alcohol dehydrogenase homology] [OR: Syne cho cyst is sp.] 
[SR:PCC 6803, , PCC 6803] [SR:PCC 6803, ] [DB:pir2] >gp : [GI : dl011336 : gl001805] 
[LN : SYCSLRG] [AC : D64005 : ABO 013 3 9] [PN : hypothetical protein] [OR : Synechocystis sp.] 
[SR:Synechocystis sp. (strain : PCC6803 ) DNA] [DB : genpept-bctl] [DE : Synechocystis sp. 
PCC6803 complete genome, 24/27, 3002966-3138603.] [NT : ORF_ID : slr0315] [LE:34344] 
[RE: 35078] [DI :direct] 



NT AA 

ORF Name NTID AAID — . T — , , score probab ility 
Length Length 

IAI7503000985 24353392 c2 1554 



TUTT 



[4804 |puT]|802 |p358||0.0 



Description 

gp: [GI:el429587:g4756149] [LN:A67161] [AC:A67161] [FN : NIRB GENE] [OR : Staphylococcus 
carnosus] [DB :genpept-pat] [DE: Sequence 1 from Patent EP0805205.] [NT:unnamed protein 
product] [LE:181] [RE:2586] [DI: direct] >gp : [GI :g4433639] [LN : AF029224] 
[AC:AF029224 :AF029225] [PN: NirB] [GN:nirB] [OR : Staphylococcus carnosus] 

[DB : genpept - be t 2 ] [DE : Staphylococcus carnosus nir and nar operons, complete sequences.] 
[NT:similar to Escherichia coli NADH- dependent nitrite] [LE:2366] [RE:4771] [DI:direct] 

NT AA 

ORF Name NTID AAID — _ — , , score probability 
LengthLength c *- 



&I7S0300098g_24406B42J:3_i293 | [1033 | [4805 |p^~[ [112 ||513 ||3.2e-49 



Description 

gp: [GI:g2914128] [LN: SEU43366] [AC:U43366] [PN:IcaD] [GN : icaD] [OR : Staphylococcus 
epidermidis] [DB : genpept -bet 2] [DE : Staphylococcus epidermidis operon mediating 
intercellular adhesion: I caR, IcaA, IcaD, IcaB, and IcaC genes, complete cds . ] [LE:1963] 
[RE:2268] [DI: direct] 

NT AA 

ORF Name NTID AAID — ^ r — , , score probability 

Length Length 



IAI7503000985 244059b^> cl 1434 



TTJ7F 



[4806 ||447 I |T33~ 



TF5" 



.9e-i5 



Description 

Sp : [LN: PETP_RHOCA] [AC:P31078] [GN : PETP] [OR : RHODOBACTER CAPSULATUS] 
[SR: , RHODOPSEUDOMONAS CAPSULATA] [DE : PETP PROTEIN] [SP:P31078] [DB : swissprot] 
>pir: [LN:S22631] [AC : S22631 : S21001 ] [PN:petP protein] [GN:petP ] [OR : Rhodobacter 
capsulatus] [DB:pir2] >gp : [GI : e49248 :gl333802] [LN:RCPETPR] [AC : Z12113 : S42 067] 
[PN: protein of unknown function] [GN:petP] [OR: Rhodobacter capsulatus] 
[DB: genpept-bctl] [DE :R. capsulatus petP, petR, and fbcF genes.] [NT:part of the petPR 
operon in front of fbc operon] [SP:P31078] [LE:199] [RE: 699] [DI: direct} 

NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength c *~ 



|AI7S03000986ja441SS87_c2_1761 | [1035 | [ 4807 || 927 "^| [308 | p3~ || 2 . 7e-05 

Description 

gp: [GI:g3955198] [LN : AF022796] [AC : AF022796] [PN:ModA] [GN:modA] [OR : Staphylococcus 
carnosus] [DB : genpept -bct2] [DE : Staphylococcus carnosus molybdenum cof actor 
biosynthetic genecluster, complete sequence.] [NT : molybdate -binding lipoprotein of the] 
[LE:109] [RE: 894] [DI: direct] 



287 



NT AA 

QRF Name I?™ AAI5 Le — thLe — th score probability 



IAI75030009B5 244160^8 el 1524 



1035 | [ 4808 | p5~ ||182 | p5~ || 8 . 3e-51 



Description 

pir: [LN:C69996] [AC:C69996 ] [PN : conserved hypothetical protein ytml] [GNrytml ] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell84188 :g2635404] [LN : BSUB0015] 
[AC:Z99118 :AL009126] [GN:ytmI] [FNrunknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to hypothetical proteins from B. subtilis] [LE: 212192] [RE: 212728] 
[DI: complement] >gp: [GI:ell85812 :g2 635423] [LN : BSUB0016] [AC : Z99119 : AL009126] [GN:ytmI] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 16 of 21): from 2997771to 3213410.] [NT:similar to hypothetical 

proteins from B. subtilis] [LE:9552] [RE:10088] [DI : complement] >gp : [GI :g2293246] 
[LN:AF008220] [AC : AF008220] [PN:YtmI] [GN:ytmI] [OR:Bacillus subtilis] 
[DB : genpept-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic region.] [NT:similar to a 

hypothetical 19 kD protein from B . ] [LE: 170339] [RE: 170875] [DI: direct] 

NT AA 

QRF Name NT ID AAID Le ~ thLe ~ th score probability 



IAI7503000385 24421537 c3 1559 



[1037 | [4805 |pF5"||422 ||778 ||2.7e-77 



Description 

pir: [LN:B69801] [AC:B69801 ] [PN : chloramphenicol resistance protein homolog yfhl] 
[GN:yfhI ] [CL:Streptomyces lividans chloramphenicol resistance protein] [OR: Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell82844:g2 633178] [LN : BSUB0005] [AC : Z99108 : AL009126] 
[GN:yfhI] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21): from 802821 tol011250.] [NT:similar to antibiotic 
resistance protein] [LE:123573] [RE:124766] [DI:direct] 



NT AA 

QRF Name NT ID AAID Le ~ thLe ~ th score probability 



|AI7S0300098S_24429653_ci_i444 | [1038 | |481P~~ |pTT~| | 310 || 325 || 2.ie-29 | 

Description 

sp : [LN : YXDK_BACSU] [AC:P42422] [GN : YXDK : B6 5E] [OR:BACILLUS SUBTILIS] [EC:2.7.3.-] 
[DE: (EC 2.7.3.-)] [SP:P42422] [DB : swissprot ] >pir : [LN : H70073] [AC:H70073 ] 
[PN: two -component sensor histidine kinase homolog yxdK] [GN : yxdK ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :dl 003811 :g709992] [LN:BACIOLO] [AC:D14399] [PN : hypothetical 
protein] [GN : B65E] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1 (168 
trpC2)) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 15 kb chromosome segment contains 
the iol operon.] [NT homologous to sensor protein BvgC, His protein] [LE:11893] 
[RE:12870] [DI:direct] >gp : [GI :ell84690:g2 636511] [LN : BSUB0021] [AC : Z99124 : AL009126] 
[GN:yxdK] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 21 of 21): from 3999281to 4214814.] [NT:similar to 
two-component sensor histidine kinase] [SP:P42422] [LE: 70819] [RE: 71796] 
[DI : complement] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



|&:t75030009BB_24432327_c2_1777 | [1039 | [4811 |p^"| |554 | piT] |6 . 7e-i68 | 



Description 

gp: [GI:g473902] [LN : LACALS] [AC:L16975] [PN : alpha-acetolactate synthase] [GN:als] 
[OR:Lactococcus lactis] [SR :Lactococcus lactis (strain DSM 20384, sub_species lactis) 

DNA] [DB: genpept-bctl] [DE : Lactococcus lactis alpha-acetolactate synthase (als) gene, 

completecds. ] [LE:1232] [RE:2896] [DI: direct] >gp : [GI :g809618] [LN:A23961] [AC:A23961] 
[PN: alpha-acetolactate synthase] [OR : Lactococcus lactis] [DB :genpept-pat] [DE : L . lactis 

alpha-acetolactate synthase gene.] [LE:550] [RE: 2214] [DI: direct] 



288 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th ££^e probability 

[AI75030009ab_24b00i00_cl_1462 | [1040 | [ 4812 |pTTT] pT | p08 | |7 . 3e-m | 

Description 

pir: [LN:B69877] [AC:B69877 ] [PN:sulfate adenylyltransf erase homolog ylnB] [GNrylnB ] 
[CL: Synechocystis sulfate adenylyltransf erase : sulfate adenylyltransf erase homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e332185 :g2462958] [LN: BSPYREYLO] [AC : AJ000974] 
[PNrputative sulfate adenylyltransf erase] [GN:ylnB] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis pyrE to yloA gene region.] [LE:2374] [RE:3522] 
[DI: direct] >gp: [GI :ell85151:g2 633932] [LN: BSUB0009] [AC : Z99112 : AL009126] [GNrylnB] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 9 of 21): from 1598421to 1807200.] [NT:similar to sulfate 

adenylyltransf erase] [LE : 33187] [RE : 34335] [DI : direct] 



NT AA 

ORF Name NTID AAID — _ — , score probability 
LengthLength c JL 



A17b0300098i>_a45443_ci_154a | [1041 | [4813 |p^~| | 231 || 375 || 1.4e-34 



Description 

pir: [LN:F69879] [AC:F69879 ] [PN : phosphoglycerate dehydrogenase homolog yloW] [GN:yloW 
] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85176 :g2633957] [LN : BSUB0009] 
[AC:Z99112:AL009126] [GN:yloW] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to phosphoglycerate dehydrogenase] [LE:59221] [RE:59883] [DI:direct] 
>gp: [GI:e323511:g2337814] [LN : BSY13937] [AC:Y13937] [PN:putative YhaQ protein] 
[GN:yloW] [FN: unknown] [OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 
genomic DNA from the spoVM region.] [LE:20993] [RE:21655] [DI:direct] 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength 



AI75030009S5_24611567_c2_lfl3i | [1042 [ [ 4814 || 107^ ] | 357 ||T7B~|| 1. 7e-4S 



Description 

pir: [LN:S36209] [AC:S36209 ] [PN:dep protein precursor] [GN:dep ] [OR:Bacillus 
anthracis] [DB:pir2] >gp : [GI : dl003632 :g436034] [LN:BACDEP] [AC:D14037] [PN:ORF] 
[GN:dep] [OR:Bacillus anthracis] [SRrBacillus anthracis (strain : Davis) plasmid :pTE702 
DNA] [DB:genpept-bctl] [DE:Bacillus anthracis plasmid pTE702 dep gene for ORF, complete 
cds.] [LE:252] [RE:1652] [DI: direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|AI75030009S5_24gi7025_cl_i339 | [1043 | [4815 | p^~| | 80 | 

Description 

NO-HIT 



289 



ORF Name 



NTID 



NT 



AA 



AAID . . , , 



LengthLength 



score 



|A17j>0300098h 24617262 cl 1501 



1044 | |4816 1 1274 | JS"57r 



probability 
5.7e-6S 



Description 

sp : [LN : PANB_BACSU] [AC:P52996] [GN: PANB] [OR:BACILLUS SUBTILIS] [EC : 2 . 1 . 2 . 11] 
[DE: (KETOPANTOATE HYDROXYMETHYLTRANSFERASE) ] [SP:P52996] [DB : swissprot] 

>pir: [LN:G69671] [AC:G69671 ] [PN : ketopantoate hydroxymethyl transferase panB] [GNrpanB 
] [CL: 3-methyl-2-oxobutanoate hydroxymethyl transferase] [OR:Bacillus subtilis] 
[DB:pir2] >gp : [GI :gll46240] [LN : BACYPIA] [AC:L47709] [PN : ketopantoate 
hydroxymethyltransf erase] [GNipanB] [FN: pantothenic acid biosynthesis] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [EC : 2 . 1 . 2 . 11] [DE:Bacillus subtilis (clone YAC15-6B) ypiABF 
genes, qcrABC genes , ypjABCDEFGHI genes, birA gene, panBCD genes, dinG gene, ypmB 
gene,aspB gene, asnS gene, dnaD gene, nth gene and ypoC gene, completecds ' s . ] [NT: 4 7.1% 
of identity to the 3-methyl-2-oxobutanoate] [LE:13293] [RE:14126] [DI:direct] 
>gp: [GI:ell83688:g2634661] [LN : BSUB0012] [AC : Z99115 : AL009126] [PN : ketopantoate 
hydroxymethyl transferase] [GN :panB] [FN pantothenate biosynthesis] [OR: Bacillus 
subtilis] [DB:genpept-bctl] [EC : 2 . 1 . 2 . 11] [DE: Bacillus subtilis complete genome 
(section 12 of 21): from 2195541to 2409220.] [SP:P52996] [LE:157541] [RE:158374] 
[DI : complement] 



ORF Name 



NTID 



„ T „ NT AA 

AAID — — score 

LengthLength 



probability 



IAI7503000985 24640510 c2 1654 



1045 | [4817 |pT5~| [T77 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



1AI750300098S 24641932 c2 1734 



1046 | |4fll8 |[g72~||223 ||B00 j|7.7e-48 



Description 

gp: [GI:g4104595] [LN: AF036964] [AC : AF036 964] [PN:putative response regulator] [GN:rrpl] 
[OR: Lactobacillus sakei] [DB : genpept -bct2 ] [DE : Lactobacillus sake putative response 
regulator (rrpl) and putativehistidine kinase (hpkl) genes, complete cds . ] [NT : Rrpl ; 
member of a two-component regulatory system] [LE:2112] [RE: 2786] [DI: direct] 



ORF Name 



NTID 



„ Tr , NT AA 

AAID — — .score 

LengthLength 



IAI7503000985 24643930 c3 2138 



TuTT 



TIT 



3W 



probability 
3.9e-37 



KD 



Description 

sp: [LN:YOHK_ECOLI] [AC:P33373] [GN : YOHK] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 24.5 
PROTEIN IN PBPG-CDD INTERGENIC REGION] [SP:P33373] [DB : swissprot] >pir : [LN: E64982] 
[AC:E64982 ] [PN:yohK protein] [GN : yohK ] [CL:yohK protein] [OR: Escherichia coli] 
[DB:pir2] >gp : [GI : gl788464] [LN: AE000303] [AC : AE000303 : U00096] [PN:putative seritonin 
transporter] [GN : yohK] [FN: putative transport; Not classified] [OR Escherichia coli] 
[DB: genpept -bet 2] [DE : Escherichia coli K-12 MG1655 section 193 of 400 of the 
completegenome. ] [NT:o231; residues 10295 are 100 pet identical to] [LE.-6088] [RE: 6783] 
[DI : direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



|AI7503000^85 24646363 c3 2072 



[ 4820 || 135 TT| |449 | p7uT]| 2 . 6e-175 



Description 

gp: [GI:e315090:g2791905] [LN : SSK3MECA1] [AC:Y13052] [GN:ORF454] [OR : Staphylococcus 
sciuri] [DB:genpept-bctl] [DE:S.sciuri mecAl gene, strain K3(MM2).] [LE:4208] [RE:5572] 
[DI :direct] 



290 



NT AA 

ORFName NTID AAID ^—^—^ score probability 



[AI750300038E>_24648377_c3_2261 | [ 1049 | [ 4821 [ | 1146 | | 381 | |I5in~|| l . Oe-132 



Description 

gp: [GI :el429613 :g4756156] [LN:A67169] [AC:A67169] [ PN : NART GENE] [FN : NITRATE TRANSPORT] 
[OR: Staphylococcus carnosus] [DB : genpept -pat] [DE: Sequence 9 from Patent EP0805205.] 
[LE:538] [RE:1704] [DI: direct] >gp : [GI :g2529402] [LN : SCU40014] [AC:U40014] [PN:nitrate 
transporter] [GN:narT] [OR : Staphylococcus carnosus] [SR: Staphylococcus carnosus 
strain=TM300] [DB : genpept -bet 2 ] [DE : Staphylococcus carnosus nitrate transporter (narT) 
gene, completecds . ] [NT : NarT] [LE:90] [RE: 1256] [DI: direct] 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength c ^~ 



A:£7S03000&aB_2464SS02_c3_20fla | [1050 | [4822 |p8~] | 155 | | 228 || S.ie-15 



Description 

sp : [LN : YHGC_BACSU] [AC:P38049] [GN : YHGC] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 18.8 
KD PROTEIN IN ECSC-PBPF INTERGENIC REGION] [SP:P38049] [DB : swissprot] >pir : [LN : B40614] 
[AC:B40614 :F69832 ] [PN : conserved hypothetical protein yhgC hypothetical protein X 
(pbpF 5' region)] [GN:yhgC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g304160] 
[LN : BACPBPF] [AC:L10630] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain W168) 
DNA] [DB:genpept-bctl] [DE:Bacillus subtilis penicillin-binding protein (pbpF) gene, 5' 
end.] [NT: product unknown] [LE:247] [RE:747] [DI : complement] >gp : [GI : ell83012 :g2633346] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [GN:yhgC] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis complete genome (section 6 of 21) : from 999501 
tol209940.] [NT: alternate gene name: yixC; similar to hypothetical] [SP:P38049] 
[LE:83202] [RE:83702] [DI : complement] >gp : [GI :e3 25006 :g2 226228] [LN: BSY14083] 
[AC:Y14083] [PN : Hypothetical protein] [GN:yixC] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis chromosomal DNA, region 76-78 degrees: 
betweenglyB-aprE.] [NT: See Swiss Prot P38049; YIXC_BACSU] [SP:P38049] [LE:3061] 
[RE: 3561] [DI complement] 

NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

&:H6030009aS_2464flSbl_t2J79i | [1051 | [4823 |p77^ — | p | pre - || S . 6e-06 



Description 

gp: [GI:gl022725] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept -bet 1] 
[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT : ORF2 ] 
[LE:394] [RE:1083] [DI : complement ] >gp : [GI :g295162] [LN : STAMECRA] [AC:L14017] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain COL) DNA] 
[DB:genpept-bctl] [DE : Staphylococcus aureus methicillin-resistance protein (mecR) 
geneand unknown ORF, complete cds . ] [NT:unknown ORF1; putative] [LE:1492] [RE: 2181] 
[DI : complement] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|Ai7S030009Bb_246S23I2_±i_419 ] [1052 | [4824 |pTT| — 1(1X37 || 2.4e-115 



Description 

gp: [GI:g4835822] [LN : AF102174 ] [AC : AF102174] [PN:glycine betaine transporter BetL] 
[GN:betL] [OR:Listeria monocytogenes] [DB : genpept -bct2] [DE:Listeria monocytogenes 

glycine betaine transporter BetL (betL)gene, complete cds.] [LE:209] [RE: 1732] 
[DI:direct] 



291 



NT AA 

ORF Name NTID AAID Le ^ t ^ Le ^ t ^ SCQre probability 



AI750300098b_2466b9b7_c3_2149 | | 10b3 | [4825 | | 1494 | [497 [ | 1335 | | 2 . 5e- THT 



Description 

sp: [LN:XYLB_BACSU] [AC:P39211] [GNrXYLB] [OR : BACILLUS SUBTILIS] [EC : 2 . 7 . 1 . 17] 
[DE : XYLULOSE KINASE, (XYLULOKINASE) ] [SP:P39211] [DB : swissprot ] >pir : [LN : D69735] 
[AC:D69735 ] [PN:xylulose kinase xylB] [GN : xylB ] [CL : xylulokinase] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : gl750125] [LN : BSU66480] [AC:U66480] [PN: xylulose kinase] 
[GN:xylB] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis SpoVK (spoVK) , 
YnbA (ynbA), YnbB (ynbB), GlnR(glnR), glutamine synthetase (glnA) , YnaA (ynaA) , YnaB 
(ynaB) , YnaC(ynaC), YnaD (ynaD) , YnaE (ynaE) , YnaF (ynaF) , YnaG (ynaG) # YnaH(ynaH), 
Ynal (ynal), YnaJ (ynaJ) , xylan beta-1 , 4 -xylosidase (xynB) , xylose repressor (xylR) , 
xylose isomerase (xylA) , xylulose kinase (xylB) , YncB (yncB) , YncC (yncC) , YncD (yncD) 
and YncE (yncE)genes, complete cds.] [LE:19399] [RE:20898] [DI:direct] 
>gp: [GI:ell83420:g2634145] [LN: BSUB0010] [AC : Z99113 : AL009126] [PNixylulose kinase] 
[GN: xylB] [FN:xylose metabolism] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 7 . 1 . 17] 
[DE:Bacillus subtilis complete genome (section 10 of 21): from 1781201to 2014980.] 
[NT:alternate gene name: yncA] [SP:P39211] [LE:111450] [RE:112949] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , _ score probabil ity 
LengthLength L - 

AI750300038b_247202b7_ti_i3i | [1054 | |482g |p^~| |45 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



^I750300098b 2472217S cl Ib38 



LengthLength - 
| [4827 ||TTuT]p65 | p?T~]|2 . 9e-32 



Description 

pir: [LN:H70069] [AC:H70069 ] [PN:capsular polyglutamate biosynthesis homolog ywtB] 
[GN:ywtB ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84494 :g2636113] [LN: BSUB0019] 
[AC:Z99122 :AL009126] [GN:ywtB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: similar to capsular polyglutamate biosynthesis] [LE: 99788] [RE .-100930] 
[DI: complement] >gp: [GI : e308090 : gl894767] [LN : BSZ92954] [AC:Z92954] [GN : ywtB] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis yws [A, B, C, D, E, F, G] and gerBC 

genes.] [NT:product highly similar to Bacillus anthracis CapA] [LE:1552] [RE: 2694] 
[DI:direct] >gp: [GI :ell844 94 :g2 636113] [LN: BSUB0019] [AC : Z99122 : AL009126] [GN:ywtB] 
[FN: unknown] [OR: Bacillus subtilis] [DB:genpept] [DE: Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT:similar to capsular polyglutamate 

biosynthesis] [LE: 99788] [RE: 100930] [DI : complement] 

NT AA 

ORF Name NTID AAID — — , _ score probabil ity 
LengthLength 



AI7S030009UbJM730438_c2_ia33 | [1056 | [ 4828 | pTT^ | 1 3 9 || 374 | | i.7e-34 



Description 

gp: [GI:g4959403] [LN: AF115391] [AC : AF1153 91] [PN:ribose permease RbsD] [GN: rbsD] 
[OR Lactobacillus sakei] [DB :genpept-bct2] [DE : Lactobacillus sakei LaaA (laaA) gene, 
partial cds; LaaB (laaB) , putative acetate kinase AckA (ackA) , LaaC (laaC) genes, 
completecds; rbs operon, complete sequence; and LaaE (laaE) gene, partialcds.] 
[LE:4035] [RE:4430] [DI: direct] 

NT AA 

ORF Name NTID AAID — — , , score prob ability 
LengthLength c 



AI7S03000aflS_24797i40_c2_i87i | [1057 | [ 4823 [ pTT~ 1 1299 || 14b | |7.3e-0fl 



Description 

sp: [LN:ESTE_PSEFL] [AC:P22862] [OR : PSEUDOMONAS FLUORESCENS] [EC:3.1.1.2] 
[DE : ARYLESTERASE , (ARYL-ESTER HYDROLASE)] [SP:P22862] [DB : swissprot ] 
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ORF Name 



NT AA 
NTID AAID „ — . T — i _ 1 score 
LengthLength 



probability 



IAI7503000985 24797900 ci 1361 



1058 | |4830 |pn~||70 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID _ — Ll _ 



1AI7503000985 24812502 ci 1521 



1059 I 



_ — _ — score probabilit y 
LengthLength c 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength probability 



17503000985 24814838 c2 1789 



~| [1050 | [4832 ||g7u~||289 ||350 ||6.1e-32 



Description 

pir: [LN:S76790] [AC:S76790 ] [PN : hypothetical protein] [CL : hypothetical protein bl725] 
[OR: Synechocystis sp . ] [SR:PCC 6803, , PCC 6803] [SR:PCC 6803, ] [DB:pir2] 
>gp: [GI:dl019435:gl653791] [LN:D90916] [AC : D90916 : ABO 013 3 9] [PN : hypothetical protein] 
[OR: Synechocystis sp . ] [SR: Synechocystis sp. (strain : PCC6803) DNA] [DB : genpept-bctl] 
[DE: Synechocystis sp . PCC6803 complete genome, 26/27, 3270710-3418851.] 
[NT : ORF ID:slrl563] [LE:99350] [RE:100237] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID _ — . , . 



LengthLength 



17503000985 24817202 cl 1512 



TUFT" 



[4833 |[TOT5~j pnr 



score probability 
799 ||2.3e-116 



Description 

pir: [LN:C69975] [AC:C69975 ] [PN : acyltransf erase homolog yrhL] [GN:yrhL ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :gl934616] [LN : BSU93874] [AC:U93874] [PN : hypothetical 
protein YrhL] [GN:yrhL] [OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 
cysteine synthase (yrhA) , cystathioninegamma- lyase (yrhB) , YrhC (yrhC) , YrhD (yrhD) , 
formate dehydrogenase chain A (yrhE) , YrhF (yrhF) , formate dehydrogenase (yrhG) , 
YrhH(yrhH), regulatory protein (yrhl) , cytochrome P450 102 (yrhJ) , YrhK(yrhK) , 
hypothetical protein YrhL (yrhL), putative anti-SigV factor (yrhM) , RNA polymerase sigma 
factor SigV (sigV) and YrhO (yrhO) genes, complete cds, and YrhP (yrhP) gene, partial 
cds.] [NT: similar to Haemophilus influenzae hypothetical] [LE: 13904] [RE: 15808] 
[DI: complement] >gp: [GI :ell83 944 :g2 635160] [LN:BSUB0014] [AC : Z99117 : AL009126] [GN:yrhL] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 14 of 21): from 2599451to 2812870.] [NT:similar to acyltransf erase] 
[LE : 171138] [RE : 173 042] [DI : direct] 



ORF Name 

IAI7503000985 24855337 c2 1957 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



][ 



TuTT 



[4834 ||693~[[230 | [1003 ||3 . 8e-i0i 



Description 

gp: [GI:el429602 :g4756154] [LN:A67161] [AC:A67161] [FN:NARI GENE] [OR : Staphylococcus 
carnosus] [DB : genpept-pat] [DE: Sequence 1 from Patent EP08052 05.] [NT: unnamed protein 
product] [LE:9942] [RE:10625] [DI: direct] >gp : [GI :g3929525] [LN : AF02 9224] 
[AC:AF029224 :AF029225] [PN:NarI] [GN:narI] [OR : Staphylococcus carnosus] 

[DB:genpept-bct2] [DE : Staphylococcus carnosus nir and nar operons, complete sequences.] 
[NT:similar to Escherichia coli nitrate reductases NRA] [LE:12127] [RE:12810] 
[DI : direct] 
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ORF Name 



NT ID AAID m 



AI75 03 0 0098b_2 501'/8_C3_2162 



NT AA 
— « _ — , , score 
LengthLength - 



probability 



] [1063 | [48:35 |pT^ — 1 177 |p [10.0015 



Description 

pir: [LN:E70557] [AC:E70557 ] [PN : hypothetical protein Rvl615] [GN:Rvl615 ] 
[OR: Mycobacterium tuberculosis] [DB:pir2] >gp : [GI : e316881 :g2113899] [LN :MTCY01B2] 
[AC:Z95554 :AL123456] [PN : hypothetical protein Rvl615] [GN:Rvl615] [OR .-Mycobacterium 

tuberculosis] [DB : genpept-bctl] [DE : Mycobacterium tuberculosis H3 7Rv complete genome; 

segment 72/162.] [NT:Rvl615, (MTCY01B2 . 07) , len: 146. Function: unknown] [LE:7451] 
[RE:7891] [DI :direct] 



ORF Name 



IAI7503000985 2507950 ti 19$ 



Description 
NO-HIT 



ORF Name 



^17503000985 251 tl 433 



Description 
NO-HIT 



_ __ _ _ NT AA 

NT ID AAID — ^ T — , , score 

LengthLength 



106 5 



probability 



1064 | [4836 |P^~| J7T 



1TmTn NT AA 

NT ID AAID — , — . , score 

LengthLength 



probability 



|4837 j|T53 | |5TT 



ORF Name 



IA17503000985 25398426 tl 211 



Description 
NO-HIT 

ORF Name 



IAI7503000985 25413577 t3 1032 



Description 
NO-HIT 

ORF Name 



1AI7503000 985 25429700 c2 1683 



_ ___ _ _^ „^ NT AA 

NTID AAID — _^ — , , score 
LengthLength 



probability 



"j |1066 | |4838 [ 



[T^5 I 



55" 



NTID 



_ _ -w-^->^ NT AA 

AAID — — , , score 

LengthLength 



probability 



][ 



|4839 ||129 | 



NTID 



NT 



AA 



AAID . , 



LengthLength 



score 



probability 



] [1068 | [4840 |p5F]|451 ||158 ||i.8e-08 



Description 

gp: [GI:e332306:g2462111] [LN : BCY11138 ] [AC:Y11138] [GN : ORF1] [OR:Bacillus cereus] 
[DB: genpept-bctl] [DE:B. cereus DNA for ORF1, ORF2 and ORF3 (2402 bp).] [NT:shows weak 
homology to C. elegans cosmid C33A12 ORF] [LE:156] [RE:1373] [DI:direct] 



ORF Name 



NTID 



NT 

AAID _ — 



AA 



LengthLength 



score 



IAI7503000985 25431558 ti 63 



] [1069 | [4841 | |7Trr-|| 234 [Im- 



probability 
1.9e-85 



Description 

gp: [GI:gl575026] [LN : SAU52 961] [AC:U52961] [PN : LrgB] [GN : lrgB] [OR : Staphylococcus 
aureus] [SR : Staphylococcus aureus strain=NCTC 8325-4] [DB : genpept-bct2] 

[DE: Staphylococcus aureus holin-like protein LrgA (lrgA) and LrgB (lrgB) genes, complete 
cds.] [NT :LytSR- regulated gene; similar to E.coli yohK] [LE:805] [RE: 1506] [DI: direct] 
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ORF Name 



NT AA 
NTID AAID — _^ — , , scor e 
LengthLength 



probability 



IA17503000985 254J.4452 ti Si 



j [1070 | [ 4842 | |179r ||596 | p^T]| l . le-192 



1 



Description 

gp: [GI:g862312] [LN : STALYTS] [AC:L42945] [GN:lytS] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE : Staphylococcus aureus lytS and lytR genes, complete cds . ] [LE:92] 
[RE: 1846] [Dlidirect] 



ORF Name 



117503000985 255702S2 ti 358 



NTID 



^ m NT AA 

AAID — . — score 

LengthLength 



probability 



TU7T 



144 1 1*7 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



[AI7503000985 25578215 c2 1740 



] [1072 | [4844 |pH~ ||180 |p3~| [ 



probability 
4.9e-46 



Description 

gp: [GI:e316582 :g2791909] [LN : SSK11MECA] [AC:Y13094] [GN:ORF141] [OR : Staphylococcus 
sciuri] [DB:genpept-bctl] [DErS.sciuri mecA gene, strain Kll (792).] [LE:4489] 
[RE:4914] [DI:direct] 



ORF Name 



U7503000985 2557962 17 698 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



[1073 | [4845 ||TT7T|| 458 | p5T~]| 7 . 7e-9& 



Description 

sp : [LN : NAOX_ENTFA] [AC:P37061] [GN:NOX] [OR : ENTEROCOCCUS FAECALIS] [SR :, STREPTOCOCCUS 
FAECALIS] [EC:1.6.99.3] [DE:NADH OXIDASE, (NOXASE) ] [SP:P37061] [DB : swissprot] 
>pir: [LN:S26965] [AC:S26965 ] [ PN : NADH oxidase] [CL:NADH peroxidase] [OR : Enterococcus 
faecalis] [DB:pir2] >gp : [GI :g47045] [ LN : S FNOXAA] [AC :X68847 : S45681] [PN: NADH oxidase] 
[GN:nox] [OR : Enterococcus faecalis] [DB : genpep t - be 1 1 ] [DE : S . faecalis nox gene for NADH 
oxidase.] [SP:P37061] [LE:88] [RE:1428] [Dlidirect] 



ORF Name 



NTID AAID 



p7503000985_25580425_ii_74 

Description 

NO-HIT 



NT AA 
LengthLength 
] [1074 | [4846 |p3~||44 | 



— v score probability 



ORF Name 



NTID AAID 



^17503000985 25585932 cl 1598 



[TUTT 



4847 



NT AA 
LengthLength 
T7F 



score probability 
9.0e-22 



Description 

gp: [GI:dl039105:g4514322] [LN : AB013367] [AC : AB013367] [PN:YdeI] [GN:ydeI] 
halodurans] [SR:Bacillus halodurans (strain : C- 125) DNA] [DB :genpept-bctl] 
halodurans C-125 ydel gene, complete cds.] [LE:276] [RE:869] [DI:direct] 



[OR: Bacillus 
[DE: Bacillus 
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NT AA 

QRF Name »™ MID Le — thLe — th gcore probability 



A1750300098bJ>b60001b_c3_2090 | [ 1076 | [ 4848 | [TOT] [543 | p5T]| 2 . 9e-150 



Description 

pir : [LN:E69989] [ACrE69989 ] [PN : acetate- -CoA ligase, ytcl] [GNrytcI ] [CL : acetate- -CoA 
ligase homology] [ORrBacillus subtilis] [EC:6.2.1.1] [DB:pir2] 

>gp: [GI :ell85829 :g2635440] [LN : BSUB0016] [AC : Z99119 : AL009126] [GNrytcI] [FN:unknown] 
[ORrBacillus subtilis] [DB rgenpept-bctl] [DE:Bacillus subtilis complete genome (section 
16 of 21): from 2997771to 3213410.] [NTrsimilar to acetate-CoA ligase] [LE:24979] 
[RE:26574] [DI : complement] >gp : [GI : g2293232] [LN : AF008220] [AC : AF008220] [PNrYtcI] 
[GNrytcI] [ORrBacillus subtilis] [DB rgenpept-bct2] [DErBacillus subtilis rrnB-dnaB 
genomic region.] [NTrputative acetate CoA-ligase] [LEr 153853] [RE r 155448] [DIr direct] 



NT AA 

QRF Name NT ID AAID — — , score probability 
LengthLength c JL 

A17503000985_2562687b_t2_J>53 | [1077 | [4849 | [rT7~| [ 48 | 



Description 
NO-HIT 



NT AA 

QRF Name NTID AAID — — score probability 



IAI7503000985 25665427 El 1335 



TUTS" 



LengthLength - 
[4850 |pTJ5~j[101 [[84 [[0.0061 



Description 

sp: [LNrSSUA_BACSU] [AC:P40400] [GNrSSUA] [ OR r B AC I LLUS SUBTILIS] [DE r PUTATIVE ALIPHATIC 
SULFONATES BINDING PROTEIN PRECURSOR] [SP:P40400] [DB r swissprot] >pir : [LN : 13 9927] 
[ACrI39927rC69817 ] [PN r ABC transporter (binding lipoprotein) homolog ygbA] [GNrygbA ] 
[ORrBacillus subtilis] [DBrpir2] >gp r [GIrg438472] [LNr BACORFKLM] [AC:L16808] 
[ORrBacillus subtilis] [SRrBacillus subtilis (individual_isolate MS11) (libraryr Tn91] 
[DBrgenpept-bctl] [DErBacillus subtilis orfK, orfL and orfM, complete cds 1 s . ] 
[NT r Likely N- terminal signal sequence, followed by] [LE r 69] [RErl067] [DIrdirect] 
>gpr [GIrell82873 rg2633207] [LNr BSUB0005] [AC r Z99108 r AL009126 ] [GNrygbA] [FNrunknown] 
[ORrBacillus subtilis] [DBrgenpept-bctl] [DErBacillus subtilis complete genome (section 
5 of 21) r from 802821 tol011250.] [NT r alternate gene name r yzeA; similar to ABC] 
[SPrP40400] [LErl58868] [RErl59866] [DIrdirect] >gp r [GI re3 08630 :gl 903039] [LN r BSZ93102] 
[ACrZ93102] [PN r hypothetical 36.3 kd lipoprotein precursor] [GNryzeA] [ORrBacillus 
subtilis] [DBrgenpept-bctl] [DE : B . subtilis yga [L, M, N, O, P, Q, R, S , T] , yzdB and yze[A,C} 
genes.] [NT r homology to nitrate transport protein precursor] [SPrP40400] [LEr 1194] 
[REr2192] [DIrdirect] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



|A1 7 b03000985 25961087 rl 123 



[1079 | [4851 [pT~]|270 [[399 |[3.9e-37 



Description 

sp r [LNr YFIE__BACSU] [ACrP54721] [GNrYFIE] [OR r BACILLUS SUBTILIS] [DE r HYPOTHETICAL 31.5 
KD PROTEIN IN GLVBC 3 1 REGION] [SPrP54721] [DB r swissprot] >pirr [LNrH69802] [ACrH69802 ] 
[PNrconserved hypothetical protein yfiE] [GNryfiE ] [ORrBacillus subtilis] [DBrpir2] 
>gpr [GIrell82814 rg2633148] [LNr BSUB0005] [AC r Z99108 r AL009126] [GNryfiE] [FNrunknown] 
[ORrBacillus subtilis] [DBrgenpept-bctl] [DErBacillus subtilis complete genome (section 
5 of 21) r from 802821 tol011250.] [NTrsimilar to hypothetical proteins from B. 
subtilis] [SPrP54721] [LEr94696] [REr95553] [DIrdirect] >gp r [GI r dl009744 r gl486247] 
[LNrD50543] [ACrD50543] [PNrunknown] [GNryfiE] [FNrunknown] [ORrBacillus subtilis] 
[SRrBacillus subtilis (strainrl68, haplotype : haploid) DNA] [DBrgenpept-bctl] 
[DErBacillus subtilis DNA for 76-degree region, complete cds.] [LEr 8372] [RE r 9229] 
[DIrdirect] 
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NT AA 

ORF Name NTID AAID Le ^ t hLength score probability 

|A17S0300098b_2b9709b2J:i_315 | [1080 | [4852 ||141 [ |45 | 

Description 
NO-HIT 



ORF Name 



117503000985 26017278 ci 1441 



Description 
NO-HIT 



NT 



AA 



NTID AAID Le ~ t - hLe ~ th score probability 



[1081 | [4853 |[T32— 1[43 | 



ORF Name 



NT 

NTID AAID . — 



AA 



LengthLength 



score 



|AI7S03000985J*6i79777_cl_lSSfl | [1082 | [4854 |p^TF] [605 | [ 



WIT 



probability 
3.6e-50 



Description 

pir: [LN:S75742] [AC:S75742 ] [PN : hypothetical protein sll0556] [OR : Synechocystis sp.] 
[SRiPCC 6803, , PCC 6803] [SR:PCC 6803, ] [DB:pir2] >gp : [GI : dl011128 : gl001236] 
[LN : SYCSLLE] [AC : D64003 : ABO 013 3 9] [PN : hypothetical protein] [OR : Synechocystis sp.] 
[SR: Synechocystis sp. (strain : PCC6803 ) DNA] [DB : genpept-bctl] [DE : Synechocystis sp. 
PCC6803 complete genome, 22/27, 2755703-2868766.] [NT : ORF_ID : sll0556] [LE:45438] 
[RE: 47333] [DI : complement] 



ORF Name 



NTID 



AAID 



IAI7503000985 26188837 cl 1601 



NT AA 
LengthLength 



score 



ID 



probability 
1.2e-83 



Description 

pir: [LN:A69670] [AC:A69670 ] [PN:choline ABC transporter (choline -binding protein) 
opuBC] [GN:opuBC ] [OR : Bacillus subtilis] [DB:pir2] >gp : [GI : ell86059 :g2635884] 
[LN:BSUB0018] [AC : Z99121 : AL009126] [PN:choline ABC transporter (choline -binding] 
[GN:opuBC] [FN:high affinity transport of choline] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DErBacillus subtilis complete genome (section 18 of 21) : from 
3399551to 3609060.] [NT : alternate gene name: proX] [LE:60016] [RE:60936] 
[DI: complement] >gp : [GI :g2293449] [LN: AF008930] [AC : AF008930] [PN:choline binding 
protein precursor] [GN:opuBC] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus 
subtilis choline transport system including ATPase (opuBA) , transmembrane protein 
(opuBB) , choline binding proteinprecursor (opuBC) and transmembrane protein (opuBD) 
genes, completecds; and unknown gene.] [NT:part of choline uptake system; OpuBC; 
lipoprotein] [LE:2708] [RE: 3628] [DI: direct] 



ORF Name 



NTID 



NT AA 

AAID — — score 
LengthLength 



probability 



1&I7S03000&8S 26207537 13 1201 



Description 
NO-HIT 



[4856 1F^~I 



ORF Name 



|M750300098b_262i3885_c3_2215 



Description 
NO-HIT 



NT 



AA 



NTID AAID — — , , score probabilit y 
LengthLength c *- 



[4857 ||T?2 | |5T" 
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NT AA 

QRF Name *™ MID Le — thLe — th score probability 



AI750300098bJ^20077_i:2_445 | [1085 | [ 4858 | | 44 7 | |148 || 259 | | 2.7e-22 



Description 

pir: [LN:E69857] [AC:E69857 ] [PN : conserved hypothetical protein ykmA] [GN:ykmA ] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell81515 :g2632035] [LN: BSAJ2571] [AC : AJ002571] 
[PN : YkmA] [GNrykmA] [FNrunknown] [ORrBacillus subtilis] [DB : genpept -bet 1 ] [DErBacillus 

subtilis 168 56 kb DNA fragment between xlyA and ykoR.] [LE : 33559] [RE: 34002] 
[DI: complement] >gp: [GI : el 183335 :g2 633669] [LN: BSUB0007] [AC : Z99110 : AL009126] [GN:ykmA] 
[FN:unknown] [ORrBacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete 

genome (section 7 of 21): from 1194391to 1411140.] [NT : similar to hypothetical 

proteins] [LE : 186508] [RE: 186951] [DI : complement] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th scor e probability 

|A17b0300098b_263b341i_ci_1391 | [1087 | [ 4853 ||57g~| | 191 | 

Description 
NO-HIT 

NT AA 

QRF Name NTID AAID — — , , score probability 
LengthLength c 

AI750300098b_2638026B_c3_2042 | [1088 | [ 4860 | |1263 1 1420 | p91 | |i . 2e-131 



Description 

gp: [GI:el299582:g3687416] [LN : BLY17554] [AC:Y17554] [PN:arginine deiminase] [GN : arcA] 
[ORrBacillus lichenif ormis] [DB : genpept -bet 1] [ECr3.5.3.6] [DErBacillus lichenif ormis 
arcA, arcB, arcC and arcD genes.] [LEr248] [RE: 1489] [DI:direct] 

NT AA 

QRF Name NTID AAID — _ — , , score probability 
LengthLength c L ~ 



IAI7503000985 26383512 ci 1506 1 [TuTTS" 



[4861 |pW]|465 ||ii9S ||i-7e-i21 



Description 

sp : [LN : YDGF_BACSU] [AC:P96704] [GN: YDGF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
TRANSPORT PROTEIN IN EXPZ-DINB INTERGENIC REGION] [SP:P96704] [DB : swissprot] 
>pir: [LN:H69782] [AC:H69782 ] [PN: amino acid ABC transporter (permease) homolog ydgF] 
[GN:ydgF ] [CL:arginine permease] [OR:Bacillus subtilis] [DB:pir2] 

>gpr [GIrdl020148:gl881368] [LN : AB001488 ] [AC : AB001488] [GN:ydgF] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain: 168) DNA] [DB : genpept -bet 1] [DErBacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT: PROBABLE AMINO 
ACID TRANSPORT PERMIASE . ] [LE: 139917] [RE: 141293] [DI : complement] 

>gp: [GI:ell82528 :g2632862] [LNrBSUB0003] [AC r Z99106 r AL009126] [GNrydgF] [FNrunknown] 
[ORrBacillus subtilis] [DB : genpept -bet 1] [DErBacillus subtilis complete genome (section 
3 of 21) r from 402751 to611850.] [NTrsimilar to amino acid ABC transporter (permease)] 
[SPrP96704] [LEr203663] [REr205039] [DI r complement] >gp r [GI r ell82 54 lrg2 632875] 
[LNrBSUB0004] [AC : Z99107 r AL009126] [GNrydgF] [FNrunknown] [ORrBacillus subtilis] 
[DBrgenpept-bctl] [DErBacillus subtilis complete genome (section 4 of 21) : from 600701 
to813890.] [NTrsimilar to amino acid ABC transporter (permease)] [SPrP96704] [LEr5713] 
[REr 7089] [DI r complement ] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



E 



] [1090 I [4862 |pTT|[478 | p?T]|l . 5e-i47 



] 



Description 

pir: [LN:F69811] [AC:F69811 ] [PN : 2 -oxoglutarate/malate translocator homolog yflS] 
[GN:yflS ] [CL: 2 -oxoglutarate/malate translocator] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI :ell82747 :g2633081] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN:yflS] [FN:unknown] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NT:similar to 2 -oxoglutarate/malate translocator] 
[LE:26070] [RE:27506] [DI:direct] >gp: [GI : dl023175 : g2443241] [LN:D86417] [AC:D86417] 
[PN:YflS] [ORrBacillus subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] 
[DB :genpept-bctl] [DE:Bacillus subtilis 35.7 kb genomic DNA, 70-73 degree 
region, complete cds . ] [LE: 21015] [RE: 22451] [DI : complement] 



ORF Name 



NTID 



17503000985 26429800 ±2 605 



Description 
NO-HIT 



AAID 



NT AA 
LengthLength 
[4863 ||126~||4i | 



score probability 



ORF Name 



NT 



AA 



NTID AAID — _ „ — , , score probabil ity 
LengthLength 



E 



17503000985 26449187 cl 1436 



Description 
NO-HIT 



] [1092 | [4864 |p^~ ||42 | 



ORF Name 



NTID 



™™ NT AA 

AAID — — _ score 

LengthLength 



^7503000985 26571937 F2 570 



1093 | [4865 |[7^~||262 |p3~ | 



probability 
i.2e-83 



Description 

pir: [LN:D69845] [AC:D69845 ] [PNrthiamin biosynthesis homolog yjbT] [GN:yjbT ] 
[CL: thiamine biosynthesis protein thiG] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83189:g2633523] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbT] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to thiamin biosynthesis] [LE:50122] 
[RE : 50892] [DI :direct] 



ORF Name 



NTID 



NT 



AA 



AAID . — , — 



LengthLength 



score 



[A17503000985 26595641 t2 572 



|4866 | |206 



probability 
|6.1e-39 



Description 

sp : [ LN : APL_LACLA] [AC:Q48630] [GN: APL] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [DE: ALKALINE PHOSPHATASE LIKE PROTEIN] 
[SP:Q48630] [DB : swissprot] >pir : [LN: S39339] [AC:S39339 ] [PN: alkaline phosphatase- like 

protein] [CL:probable alkaline phosphatase yngC] [OR : Lactococcus lactis] [DB:pir2] 
>gp: [GI:g435296] [LN : LLALPHLP] [AC:Z29065] [PN:alkaline phosphatase like protein] 
[GN:apl] [OR: Lactococcus lactis] [DB : genpept-bctl] [DE:L. lactis (MG1363) apl gene for 

alkaline phosphatase like protein.] [SP:Q48630] [LE:339] [RE:1067] [DI:direct] 
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ORF Name 



NT 



AA 



^ ^ L e^thLe^th ^^ probability 



|A17b0300098b 26600425 t3 1300 



T0U1T 



[4867 |[792~ ||263 | pU~||3 . 6e-18 



Description 

sp: [LNrARYlJVIOUSE] [AC:P50294] [GN: AAC1 :NAT1] [OR:MUS MUSCULUS] [SR:, MOUSE] 
[EC:2.3.1.5] [DE : ARYLAMINE N- ACETYL/TRANSFERASE 1, (NAT-1) ] [SP:P50294] [DB : swissprot] 
>gp: [GI:gl045651] [LN : MMNAT1] [AC:U37119] [PN : arylamine N-acetyltransf erase] [GN :NAT1] 
[OR:Mus musculus] [SR:house mouse strain=C3H/HeJ] [DB : genpept-rod] [DE:Mus musculus 
arylamine N-acetyltransf erase (NAT1) gene, completecds . ] [LE:1] [RE:873] [DI:direct] 
>gp: [GI :gl008568] [LN : MMU35885] [AC:U35885] [PN : N-acetyltransf erase NAT-1] [OR:Mus 
musculus] [SR:house mouse strain=C57B16] [DB : genpept-rod] [DE:Mus musculus 
N-acetyltransf erase NAT-1 mRNA, complete cds.] [LE:217] [RE: 1089] [DI: direct] 



ORF Name 



UL7503000985 26605001 c3 2206 



NTID 



NT 

AAID — 



AA 



LengthLength 



score 



][ 



1096 



|4868 [ |274 



probability 
||i.5e-67 



Description 

sp: [LN:YXBGJ3ACSU] [AC:P46331] [GN : YXBG : E3BR] [OR : BACILLUS SUBTILIS] [EC:1. -.-.-] 
[DE: (EC 1. -.-.-)] [SP:P46331] [DB : swissprot] >pir : [LN: B70073] [AC:B70073 ] [PN:glucose 
1- dehydrogenase homolog yxbG] [GN : yxbG ] [CL : short- chain alcohol dehydrogenase 
homology] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI : dl022451 :g904199] [LN: AB005554] 
[AC:AB005554 :D45242 :D31629] [PN:probable oxidoreductase] [GN:yxbG] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1) DNA] [DB : genpept-bctl] [EC : 1 . 1 . 1 . 47] 
[DE: Bacillus subtilis genomic DNA, 36 kb region between gnt and ioloperons . ] 
[NT: conserved universally] [LE: 26446] [RE: 27234] [DI : complement] 

>gp: [GI:ell84709:g2636530] [LN : BSUB0021] [AC : Z99124 : AL009126] [GN : yxbG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
21 of 21): from 3999281to 4214814.] [NT : alternate gene name: yxaU; similar to glucose] 
[SP:P46331] [LE:91359] [RE:92147] [DI: direct] 



ORF Name 



NTID AAID — , — , score 
LengthLength 



IA1 7 503000985 26672jf>12 cl 1554 



] [1097 | [4869 ||T52~|[63 I m- 



probability 
0.048 



Description 

gp: [GI:g2444136] [LN:U88974] [AC:U88974] [PN:ORF56] [OR : Streptococcus thermophilus 
temperate bacteriophage O1205] [DB : genpept-phg] [DE : Streptococcus thermophilus 
temperate bacteriophage O1205, completegenome . ] [LE: 42158] [RE: 42484] [DI: direct] 



ORF Name 



NTID 



„ Tn NT AA 

AAID — — score 

LengthLength 



ptf7503000985_266877_t2_877 



1098 | |4870 ||450 [ [155 



probability 
|4.7e-25 



Description 

pir: [LN:F69653] [AC:F69653 ] [PN: transcription regulator lrpC] [GN:lrpC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : el 182391 :g2 632725] [LN: BSUB0003] [AC : 299106 :AL009126 ] 
[PN: transcriptional regulator (Lrp/AsnC family)] [GN:lrpC] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 3 of 21) : from 402751 
to611850.] [NT:alternate gene name: ydal] [LE:72862] [RE:73296] [DI:direct] 
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4 



NT AA 

ORF Name NTID AAID Le ~ t ft Le ^ t h scor e probability 



A1750300098b_267bi43i_c3_19 , n [ |1099 | [4871 ||10g2 1 1353 1 11037 ||9 . 5e-105 | 



Description 

pir: [LN:E69581] [AC:E69581 ] [PN:acetoin dehydrogenase El component (TPP-dependent beta 
subunit) acoB] [GN:acoB ] [CLipyruvate dehydrogenase (lipoamide) beta chain] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82797 :g2633131] [LN : BSUB0005] 
[AC:Z99108 :AL009126] [PN:acetoin dehydrogenase El component] [GN : acoB] [ORrBacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 5 of 21): 
from 802821 tol011250.] [NT : alternate gene name: yfjJ] [LE:76693] [RE:77721] 
[DI: direct] >gp: [GI : dl025207 :g2780394] [LN:D78509] [AC:D78509] [PN: Yf j J] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] [DB : genpept-bctl] [DE:Bacillus 
subtilis YfjG-YfjR genes, complete cds.] [LE:8604] [RE:9632] [D I : complement] 
>gp: [GI :g2245638] [LN : AF006075] [AC : AF006 075] [PN : TPP-dependent acetoin dehydrogenase, 
El] [GN : acoB] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis acetoin 
dehydrogenase enzyme system gene cluster , ribosomal protein L6-like protein gene, 
partial cds, TPP-dependentacetoin dehydrogenase, El alpha-subunit (acoA) , 
TPP-dependentacetoin dehydrogenase, El beta-subunit (acoB) , 

dihydrolipoamideacetyl transferase (acoC) and dihydrolipoamide dehydrogenase 

(acoL) genes, complete cds, and regulatory protein (acoR) gene, partialcds.] [NT: beta 

subunit of the El component of the acetoin] [LE:1830] [RE: 2858] [DI:direct] 

>gp: [GI:g2245638] [LN : AF006075] [AC : AF006075] [PN : TPP-dependent acetoin dehydrogenase, 

El] [GN: acoB] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis acetoin 

dehydrogenase enzyme system gene cluster , ribosomal protein L6-like protein gene, 

partial cds, TPP-dependentacetoin dehydrogenase, El alpha-subunit (acoA) , 

TPP-dependentacetoin dehydrogenase, El beta-subunit (acoB) , 

dihydrolipoamideacetyltransf erase (acoC) and dihydrolipoamide dehydrogenase 
(acoL) genes, complete cds, and regulatory protein (acoR) gene, partialcds.] [NT: beta 
subunit of the El component of the acetoin] [LE:1830] [RE: 2858] [DI: direct] 

1 ""ii— 1« ■■■■iiiiiimnniiiLii.miiiijii»uLim»ui.i.ii .1 . ■ m ■ ...,.„.„, . ..M.M.— 1 ■J— ■» n. .. i.i.n— , —- — — ^„ , ■ ■ .. , . , ,,. , „ ,,» , — . ... .«., 11, „ . „ .., „ , f 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength c *- 



AI7S03000985_2g751887_cl_U08 | [1100 | [4872 |31 7 | p~ 



|2.5e-25 



Description 

sp: [LN:APBA_AQUAE] [AC:067619] [GN : APBA : AQ_1 727] [OR:AQUIFEX AEOLICUS] [EC : 1 . 1 . 1 . 169] 
[DE REDUCTASE) (KPA REDUCTASE) ] [SP:067619] [DB : swissprot] >pir : [LN: A70449] [AC:A70449 
] [PN: hypothetical protein aq_1727] [GN:aq_1727 ] [OR:Aguifex aeolicus] [DB:pir2] 
>gp: [GI:g2984043] [LN: AE000753] [AC : AE000753 : AE000657] [PN:putative protein] 
[GN:aq_1727] [OR:Aquifex aeolicus] [DB :genpept-bct2] [DE:Aquifex aeolicus section 85 of 
109 of the complete genome.] [LE:5968] [RE: 6900] [D I : complement] 



ORF Name 


NT AA 

NTID AAID . — . , , — . , score probability 
LengthLength ^ L - 


AI750300098^_273452_c2_18I5 


| |1101 |4873 ||1278||425 ||800 ||1.2e-79 



Description 

sp : [LN : HMDH_ARCFU] [AC: 02 8 53 8] [GN : HMGA : AF1 736] [OR : ARCHAEOGLOBUS FULGIDUS] 
[EC:1.1.1.34] [DE : REDUCTASE) ] [SP:028538] [DB : swissprot] >pir : [LN:G69466] [AC:G69466 ] 
[PN: 3 -hydroxy- 3 -methylglutaryl- coenzyme A reductase (mvaA) homolog] [OR: Archaeoglobus 
fulgidus] [DB:pir2] >gp: [GI :g2648815] [LN: AE000983] [AC : AE000983 : AE000782] 
[PN: 3 -hydroxy- 3 -methylglutaryl -coenzyme A reductase] [GN:AF1736] [OR : Archaeoglobus 
fulgidus] [DB:genpept-bct2] [DE : Archaeoglobus fulgidus section 124 of 172 of the 
complete genome.] [NT:similar to SP:P13702 GB:M29727 GB:M31807 PID:151259] [LE:7093] 
[RE : 8403 ] [DI : complement] 



301 



ORF Name 



|AI 7 50300098t>_27ib007_c3_1983 
Description 



NTID^ 



AAID 



NT 



AA 



Length Length 



score probability 



[1102 | V [3T74 ||138 ||45 | 



NO-HIT 



ORF Name 



NTID AAID _ . , , 



\I7503000985 2739050 C2 1816 



TTTFT 



NT AA 
LengthLength 



score 



probability 
1.2e-56 



Description 

sp: [LN:YWBI_BACSU] [AC:P39592] [GN: YWBI : IPA-24D] [OR:BACILLUS SUBTILIS] 

[DE: HYPOTHETICAL TRANSCRIPTIONAL REGULATOR IN THIK-EPR INTERGENIC REGION] [SP:P39592] 
[DB:swissprot] >pir : [LN : S39679] [AC : S39679 :G70051 ] [PN : transcription regulator homolog 
ywbI:protein ipa-24d] [GNrywbl ] [CL:probable transcription regulator lsyR] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g413948] [LN : BSGENR] [AC:X73124] [GN:ipa-24d] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] 
[SP:P39592] [LE:24460] [RE:25365] [DI:direct] >gp : [GI :ell86330:g2 636366] [LN: BSUB0020] 
[AC:Z99123 :AL009126] [GNrywbl] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:alternate gene name: ipa-24d; similar to] [SP:P39592] [LE:132594] [RE:133499] 
[DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000985 2739561 tl 99 



[4875 |prau-||479 | |lb05 ||2 . 5e-154 



] 



Description 

sp: [LN:GLPT_BACSU] [AC:P37948] [GN: GLPT] [OR:BACILLUS SUBTILIS] [DE : PERMEASE) ] 
[SP:P37948] [DB : swissprot] >pir : [LN : 140417] [AC : 140417 : F69634 : S3 7250 ] 
[PN: glycerol -3 -phosphate transport protein glpT : glycerol -3 -phosphate permease glpT] 
[GN: glpT ] [CL:hexose phosphate transport protein uhpT] [OR .-Bacillus subtilis] 
[DB:pir2] >gp : [GI : dl034077 : g3599636 ] [LN : AB006424] [AC : AB006424] [GN:ybeE] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB : genpept-bctl] [DErBacillus 
subtilis genomic DNA, 70 kb region between 17 and 23degree.] [LE : 36984] [RE: 38318] 
[DI: complement] >gp : [GI :g403372] [LN: BSGLPTQ] [AC:Z26522] [PN:glycerol 3-phosphate 
permease] [GN:glpT] [FN: uptake of glycerol 3-phosphate] [OR .-Bacillus subtilis] 
[DB: genpept-bctl] [DE :B . subtilis glpT and glpQ genes for glycerol 3-phosphate 
permeaseand glycerophosphoryl diester phosphodiesterase.] [SP:P37948] [LE:315] 
[RE:1649] [DI: direct] >gp : [GI :ell82166:g2 632500] [LN : BSUB0002] [AC : Z99105 : AL009126] 
[PN: glycerol -3 -phosphate permease] [GN : glpT] [FN:uptake of glycerol-3 -phosphate] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [NT : alternate gene name: ybeE] [SP:P37948] [LE:39333] 
[RE:40667] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — — , score 
Length Length 



probability 



E 



17503000985 2750930 tl 121 



Description 
NO-HIT 



[1105 j [4877 [Jigs— |p j 



ORF Name 



AI7503000985_2767577_c3_2139 

Description 

NO-HIT 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



[1106 l [4878 |p^~| |44 j 



302 



ORF Name 



NTID 



NT 



AA 



AAID . — . , _ 



LengthLength 



score 



IAI7503000985 2790936 £2 823 



j [110 7 | |4879 |pT~[pi7 |pT- 



probability 
3.1e-83 



Description 

sp: [LN:MANA_STRMU] [AC:Q59935] [GN:PMI] [OR : STREPTOCOCCUS MUTANS] [EC:5.3.1.8] 
[DE: (PMI) (PHOSPHOHEXOMUTASE) ] [SP:Q59935] [DB : swissprot] >gp : [GI :dl004537 :g451216] 
[LN : STRPMI] [AC:D16594] [PN:Mannosephosphate Isomerase] [GN:pmi] [OR : Streptococcus 
mutans] [SR : Streptococcus mutans (strain : GS-5) DNA] [DB : genpept-bctl] [EC:5.3.1.8] 
[DE:S. mutans pmi gene for mannosephosphate isomerase (complete cds) andscrK gene for 
fructokinase (partial cds).] [LE:241] [RE:1191] [DI:direct] 



ORF Name 



IAI7S03000S85 2814000 c3 2159 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



Description 
NO-HIT 



[4880 |[TF6 | pT 



ORF Name 



NT 



AA 



NTID AAID — . T — , , score probabi lity 
LengtnLengtn 



IA17503000985 2819803 c3 1966 



j [1109 | [4881 |p63~||120 | pre - |[i . ie-16 



Description 

gp: [GI:el312399:g3341642] [LN: VCH231123] [AC : AJ231123 ] [GN:z61r] [OR:Vibrio cholerae] 
[DB:genpept-bctl] [DE:Vibrio cholerae z61r gene.] [LE:47] [RE:388] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000985 2823562 c3 2100 



] [1110 | [4882 |pST]|454 ||909 ||3.5e-91 



Description 

pir: [LN:A71026] [AC:A71026 ] [PNrprobable aminotransferase] [GN:PH1501 ] [OR : Pyrococcus 
horikoshii] [DB:pir2] >gp: [GI :dl 031552 :g3 257926] [LN : AP000006] 

[AC:AP000006 :AB005215 : AB009510 : AB009511 : AB009512 :AB009513 :AB009514] [PN:43 8aa long 
hypothetical aminotransferase] [GN:PH1501] [OR : Pyrococcus horikoshii] [SR : Pyrococcus 
horikoshii (strain:OT3) DNA, clone : Pyrococcus horikoshi] [DB : genpept-bctl] 

[DE : Pyrococcus horikoshii OT3 genomic DNA, 1166001-1485000 nt . position (6/7) . ] 

[NT: similar to owl:D5045389 percent identity: 38.287 in] [LE: 170905] [RE: 172221] 

[DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ T — , _ score 
LengthLength 



probability 



IAI7503000985 2853431 12 504 



J [1111 j [4883 ||195~-||64 | 



Description 
NO-HIT 



ORF Name 



VI7503000985 2854787 cl 1553 



Description 
NO-HIT 



NTID 
j [1112 | [4884 



NT AA 
AAID — . — . . score 
LengthLength 



probability 



TT 



303 



ORF Name 



NT ID 



NT 



AA 



AAID . — . „ — 



LengthLength 



score 



|AI750300098b 2W75"ST c2 1945 



1113 | [498b |pi~[p20 |pP-| f 



probability 
|2.7e-38 



Description 

pir : [LN:A69756] [AC:A69756 ] [PN:adhesion protein homolog ycdH] [GNrycdH ] [CL:adhesin 

B] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl023108 :g2415736] [LN: AB000617] 
[AC:AB000617] [PN: YcdH] [GNrycdH] [ORiBacillus subtilis] [SR:Bacillus subtilis 
(strain:168 trpC2) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis genomic DNA, 22 to 25 

degree region, completecds . ] [NT : homologue of adhesion protein precursor of] [LE: 21421] 
[RE:22380] [DIrdirect] >gp: [GI :ell82237:g2 632571] [LN:BSUB0002] [AC : Z99105 : AL009126] 
[GNrycdH] [FNrunknovm] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis 

complete genome (section 2 of 21): from 194651 to415810.] [NTrsimilar to adhesion 

protein] [LE: 113236] [REr 114195] [DIrdirect] 



ORF Name 



NT AA 
NTI D AAID — . T — , , score 
LengthLength 



probability 



1^17503000986 29304552 c3 2150 



[1114 | [4885 IP^ - ! 



Description 
NO-HIT 



ORF Name 



NTID 



[AI7503000985 29352312 c3 2063 



][ 



_ _ _ _ NT AA 

AAID — 

LengthLength 

[4887 |[7^5~ | [T5T 



score probability 
1617 ||3.ie-60 



Description 

Sp : [LN : YXDL_BACSU] [AC:P42423] [GN : YXDL : B65F] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
ABC TRANSPORTER ATP -BINDING PROTEIN IN IDH 3 ' REGION] [SP:P42423] [DB : swissprot] 
>pir: [LN:A70074] [AC:A70074 ] [PN : ABC transporter (ATP-binding protein) homolog yxdL] 
[GN:yxdL ] [CL : ATP-binding cassette homology] [OR:Bacillus subtilis] [DB:pir2] 
>gpr [GI rdl003812 rg709993] [LN r BACIOLO] [AC:D14399] [PN : hypothetical protein] [GN : B65F] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1 (168 trpC2)) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis 15 kb chromosome segment contains the iol 
operon.] [NT : homologous to cell division protein FtsE of E.] [LE:13014] [RE:13787] 
[DIrdirect] >gpr [GI :ell84 689 :g2 636510] [LN : BSUB002 1] [AC : Z99124 : AL009126] [GN:yxdL] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete 
genome (section 21 of 21): from 3999281to 4214814.] [NTrsimilar to ABC transporter 
(ATP-binding protein)] [SPrP42423] [LEr 69902] [REr 70675] [DI r complement] 
>gpr [GIrdl008911rgl408484] [LNrD45912] [AC:D45912] [GN: yxdL] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain:BGSC 1A1 (Marburg 168; trpC2) ) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis genome sequence between the iol and hut operon, partial and 
complete cds.] [NT homologous to FtsE protein of E. coli, belonging to] [LE:801] 
[RE: 1574] [DIrdirect] 



ORF Name 



NT 



AA 



NTID AAID — « — , , score probabi lity 
LengthLength L - 



AI7S03000985_29375307_r2_499 

Description 

NO-HIT 



"j |iil5 | [4888 |[I?5~||44 ~[ 



304 



NT AA 

Name NTID AAID — — score probability 



Length Length ' 

A175030009Bb_2a400JJ2_cl_li>71 | [ 1117 | [ 4889 |pT7"^| i38 | [Tpy- fll . 3e-27 | 



Description 

pir: [LN:C69770] [AC:C69770 ] [PN : hypothetical protein ydaT] [GNrydaT ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : dl020028 : gl881248] [LN: AB001488] [AC : AB001488] [GNrydaT] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 
degree.] [NT: FUNCTION UNKNOWN . ] [LE: 26046] [RE: 26498] [D I : complement] 

>gp: [GI:ell82404:g2632738] [LN : BSUB0003 ] [AC : Z99106 : AL009126] [GNrydaT] [FNrunknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [LE:89790] [RE:90242] [DI : complement] 



NT AA 

ORF Name NTID AAID Le ^ t fr Le ^ th score probability 

M7503000985_29493627_t2_872 | [1118 | [4890 | pTS3 — 1 154 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

|AI7S0300098SJ39B03403J:2J7B8 | [1119 | [4891 p" 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ — , , score probability 



IAI7503000985 29532827 t2 477 



TT2TT 



LengthLength 
|4892 | pT 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

AI750300098S_29695327_c2_i739 | [ 1121 | [ 4893 |[7Fu~|| 249 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ , — , , score probability 
LengthLength c i - 

|&I7S03000S86_299B6003_c2jL894 | |ii22 | |4894 ||i29 |p — 

Description 
NO-HIT 



305 



ORF Name 



NT AA 

NTID AAID «^ _ 1 , score 

Length Length 



probability 



117503000985 30078^7 c'l 1943 



] [1123 | [ 4895 |[7?g— 1|26:> | p5~ | |b . 8e-59 



Description 

pir: [LN:E69761] [AC : E69761 : 140450 : S52381 ] [PN:probable glutamine ABC transporter] 
[GN:yckK ] [CL:lysine-arginine-ornithine-binding protein] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell82313 : g2632647] [LN :BSUB0002] [AC : 299105 : AL009126] [GN : yckK] 
[FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 

genome (section 2 of 21) : from 194651 to415810.] [NT: similar to glutamine ABC 

transporter] [LE:215587] [RE:216393] [DI : complement] >gp: [GI :ell82328:g2 632662] 

[LN:BSUB0003] [AC : Z99106 : AL009126 ] [GN : yckK] [FN:unknown] [OR:Bacillus subtilis] 

[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 3 of 21) : from 402751 

to611850.] [NT: similar to glutamine ABC transporter] [LE:7487] [RE: 8293] 
[DI: complement] >gp : [GI :dl009629 :gl805432] [LN:D50453] [AC:D50453] [PN : homologue of 

glutamine -binding periplasmic] [GN: yckK] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain:168 trpC2) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA for 25-36 degree 

region containing theamyE-srfA region, complete cds . ] [LE: 89890] [RE: 90696] 
[DI : complement] 



ORF Name 



IAI7503000985 30078378 ci 1437 



Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 



score 



probability 



] | 1124 1 |489S |[T^— | [54 | 



ORF Name 



AI7503000985 3009382 C3 1965 



Description 
NO-HIT 



ORF Name 



[AI7503000985 30095011 tl 179 



Description 
NO-HIT 



NT AA 
NTID AAID — . w — , - score 
Length Length 



probability 



] [1125 | [4897 |pi~ [|175 | 



NTID 



NT AA 

AAID 

LengthLength 

05" 



score 



probability 



ORF Name 



NTID 



F 



„ TT ^ NT AA 

AAID — — score 

LengthLength 



probability 



1750300098b 3013356i> cl 1399 



] [1127 | [4899 |pT^T]|395 ||306 ||2 . 8e^T7 



Description 

sp : [LN : YYBF_BACSU] [AC:P37498] [GN : YYBF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 44.2 
KD PROTEIN IN COTF-TETB INTERGENIC REGION] [SP:P37498] [DB : swissprot] >pir : [LN: S65991] 
[AC:S65991:A70087 ] [PN:membrane protein yybF] [GN:yybF ] [CL:probable antibiotic 
resistance protein yybF] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :dl005739 :g467351] 
[LN:BAC180K] [AC:D26185] [PN: unknown] [OR: Bacillus subtilis] [SR: Bacillus subtilis 
(sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE:27159] [RE:28373] [DI : complement] 
>gp: [GI:ell84792 :g2636613] [LN : BSUB0021] [AC : Z99124 : AL009126] [GN:yybF] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
21 of 21): from 3999281to 4214814.] [NT:similar to antibiotic resistance protein] 
[SP:P37498] [LE:179091] [RE:180305] [DI : complement] 



ORF Name 



AI750300098b_302_t2jS45 

Description 

NO-HIT 



NT AA 
NTID AAID — _ — _ score 
LengthLength 

|1128 | [4900 ||i62 1 153 | 



probability 



306 



NT AA 

ORF Name NTID AAID Le ~ t hLeng th score probability 



[Al7b0300098! 3 _303fe092b_t2_901 | [1129 | [ 4901 || 642 — | pi3 [[ 204 |[2.0e-16 

Description 

sp: [LN:Y3 57_HAEIN] [AC:P44658] [GN:HI0357] [OR : HAEMOPHILUS INFLUENZAE] [DE : PUTATIVE 
THIAMINE BIOSYNTHESIS PROTEIN HI0357] [SP.-P44658] [DB : swissprot] >pir : [LN: C64063] 
[AC:C64063 ] [PN : hypothetical protein HI0357] [OR : Haemophilus influenzae] [DB:pir2] 
>gp: [GI:gl573325] [LN:U32720] [AC :U32720 : L42023] [PN:thiamine biosynthesis protein, 
putative] [GN:HI0357] [OR : Haemophilus influenzae Rd] [DB : genpept-bct2] [DE : Haemophilus 
influenzae Rd section 35 of 163 of the complete genome.] [NT: similar to SP:P42883 
SP:P43534 SP:P47183] [LE:3006] [RE:3950] [DI: direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th SCQre probability 



IAI7503000985 303S77S7 ti 5$ 



TTTD" 



[4902 |p3TT[|510 ]^ TT 



5.9e-3i 



Description 

pir: [LN:D71235] [AC:D71235 ] [PN : hypothetical protein PH0142] [GN:PH0142 ] 
[OR : Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030154 :g3256528] [LN : AP000001] 
[AC:AP000001:AB009465 :AB009464 :AB009466 : AB00946 7 : AB009468 :AB009469] [PN:289aa long 

hypothetical protein] [GN:PH0142] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 1-287000 

nt. position (1/7).] [NT :motif =soybean trypsin inhibitor (Kunitz) protease] [LE:124718] 
[RE: 125587] [DI :direct] 

NT AA 

ORF Name NTID AAID — — , , score probability 
Length Length c *- 



|A!t7503u0098g_30470325__c3_2i8i | [1131 | [ 4903 | [I^r[| g50 | p^~|| 8 . le-38 

Description 

pir: [LN:A70027] [AC:A70027 ] [PN : conserved hypothetical protein yvaC] [GN:yvaC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86043 : g2635868] [LN : BSUB0018] 
[AC:Z99121:AL009126] [GN:yvaC] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 18 of 21): from 3399551to 3609060.] 
[NT: similar to hypothetical proteins] [LE: 45749] [RE: 47644] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ — , , score probability 
LengthLength *- 

|AI7503000995_30651577_ca_2155 | [1132 | [ 4904 | p^~ [| 203 | p 1| 0. 00087 — [ 

Description 

pir: [LN:G70065] [AC:G70065 ] [PN : hypothetical protein ywpE] [GN:ywpE ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell84540:g2 636159] [LN:BSUB0019] [AC : Z99122 : AL009126] 
[GN:ywpE] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 19 of 21): from 3597091to 3809700.] [LE:143698] [RE:144006] 
[DI: complement] >gp: [GI :e289144 :gl763706] [LN: BSZ83337] [AC:Z83337] [GN:ywpE] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis mbl , flh[0,P], rapD, 
ywp[B f C f D f E f P f G f H f I, J] and ywqAgenes . ] [LE:5315] [RE:5623] [DI:direct] 

>gp: [GI:ell84540:g2636159] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN:ywpE] [FN:unknown] 
[OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome (section 19 
of 21): from 3597091to 3809700.] [LE:143698] [RE:144006] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ — , , score probabilit y 
LengthLength ^ L - 



AI750300098b_31490687_t3_lQ96 | [1133 | [4905 ||141 — 1 1 46 | 



Description 
NO-HIT 



307 



ORF Name 



F 



17503000985 31517587 c2 1663 



Description 
NO-HIT 



ORF Name 



IAI7503000985 3157062 cl 1475 



Description 
NO-HIT 



NT AA 

NTID AAID Le - thLe ~ th score probability 



] | 1134 | | 4906 |p^~|| 42 | 



NTID 



AA 



NT 

AAID — - — score probability 
LengthLength u 



J [1135 | [4907 ||174 [ [57 | 



ORF Name 



1750300098b c2 1716 



NTID 

] EEE 



„ TT ^ NT AA 

AAID — , — , . score 

LengthLength 



[4908 



699 I fZ5T 



probability 
7 . le-08 



Description 

pir : [LN: JH0364] [AC:JH0364 ] [PN : hypothetical protein 176 (SAGP 5' region)] 
[OR: Streptococcus pyogenes] [DB:pir2] 



ORF Name 



NT AA 
NTID AAID — _ „ — , . score 
LengthLength 



probability 



IAI7503000985 31720942 12 813 



Description 
NO-HIT 



] | 1137 1 |4909 |pF"^| 191 [ 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



50300098b 31755012 ci 134Q 



] | 1138 | [4910 ||978~||325 [[886 ||9.6e-89 



Description 

pir: [LN:E70070] [AC:E70070 ] [PN : metabolite transport protein homolog ywtG] [GN : ywtG ] 
[CL:glucose transport protein] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84489:g2636109] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN : ywtG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT:similar to metabolite transport protein] 
[LE:94500] [RE:95873] [DI : complement ] >gp: [GI : e308095 :gl894771] [LN : BSZ92954] 
[AC:Z92954] [GN:ywtG] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis 
yws[A,B,C,D,E,F,G] and gerBC genes.] [NT:product highly similar to metabolite 
transport] [LE:6609] [RE:7982] [DI:direct] >gp : [GI : ell84489 :g2636109] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [GN:ywtG] [FN:unknown] [ORrBacillus subtilis] [DBrgenpept] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: similar to metabolite transport protein] [LE: 94500] [RE: 95873] [DI : complement] 



ORF Name 



NTID 



™™ NT AA 

AAID — — score 

LengthLength 



probability 



IAI7503000985 3182927 c2 1673 



] | m9 | | 4a11 IP^p^ ||3598||0.0 



Description 

gp: [GI:g2981225] [LN : AF053 006] [AC : AF053 006] [PN:lipase precursor] [GN:gehl] 
[OR staphylococcus epidermidis] [DB : genpept-bct2] [DE : Staphylococcus epidermidis lipase 
precursor (gehl) gene, completecds . ] [LE:148] [RE: 2214] [DI: direct] 



ORF Name 

p7503000985_32205143_c2_i642 

Description 

NO-HIT 



_ _i_ __ NT AA 

NTID AAID — — LL score 

LengthLength 

] [1140 | [4912 [[168 | [55 | 



probability 



308 



ORF Name 



NT 



AA 



NTID AAID Le - thLe - th £gore probability 



[A1750300098b_32^0^02J:2_708 | [ 1141 | [4913 |P^~] [40 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



IAI7503000985 32221012 ci 1567 



TTTT 



[4914 ||47i~||i55 |[ITT 



score probability 
4.6e-06 



Description 

sp: [LN:NPT4_HUMAN] [AC:O00476] [GN : SLC17A4 : NPT4] [OR : HOMO SAPIENS] [SR : , HUMAN] 
[DE : COTRANSPORTER 4) (NA(+)/PI COTRANS PORTER 4)] [SP:O00476] [DB : swissprot] 
>gp: [GI :g2062692] [LN:HSU90545] [AC:U90545] [PN: sodium phosphate transporter] [GN:NPT4] 
[OR:Homo sapiens] [SR:human] [DB : genpept -pri2 ] [DE:Human sodium phosphate transporter 
(NPT4) mRNA, complete cds . ] [LE:377] [RE:1582] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID . , _ — 



LAI7503000985 32614078 t2 539 



Description 
NO-HIT 



"] [1143 | [4915 



LengthLength 
1150 I 



score probability 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000985 32664093 12 568 



] [1144 | [4916 |[STr~||208 | p2~ ||6 . 7e-10 



Description 

pir : [LN:G70418] [AC:G70418 ] [PN:probable thiamine phosphate synthase 2] [GNithiEl ] 
[CL: probable thiamin-phosphate pyrophosphorylase : thiamin-phosphate pyrophosphorylase 

homology] [OR:Aquifex aeolicus] [DB:pir2] >gp : [GI :g2983767] [LN: AE000736] 
[AC:AE000736:AE000657] [PN: thiamine phosphate synthase] [GNrthiEl] [OR:Aquifex 

aeolicus] [DB : genpept -bct2] [DE:Aquifex aeolicus section 68 of 109 of the complete 

genome.] [LE: 15292] [RE: 15852] [DI : complement] 



ORF Name 



UC7503000985 33203385 c3 1997 



NT 



AA 



NTID AAID , . , . — 



LengthLength 



score 



][ 



[4917 



probability 
2.5e-97 



Description 

gp: [GI:g2978430] [LN : SEU43366] [AC:U43366] [PN:IcaR] [GN : icaR] [OR : Staphylococcus 
epidermidis] [DB : genpept -bct2] [DE : Staphylococcus epidermidis operon mediating 
intercellular adhesion : IcaR, IcaA, IcaD, IcaB, and IcaC genes, complete cds.] [LE:39] 
[RE: 596] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — — . . score 
LengthLength 



probability 



IAI7503000985 33209682 c2 1718 



[4918 ||123 | 



Description 
NO-HIT 
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ORF Name 



NT ID 



NT 

AAID _ 



AA 



Length Length 



score 



A1750300098b_33211092_tj_il23 | [1147 | [4919 ||471 | |156 | [7? 



probability 
uTTJTO 



Description 

sp: [LN:VL02_VARV] [AC:P33041] [GN: L2R :M2R] [OR: VARIOLA VIRUS] [DE : PROTEIN L2] 
[SP:P33041] [DB:Swissprot] >pir : [LN: S33088] [AC : S33 088 : H36844 ] [PN:L2R protein :M2R 

protein] [CL:vaccinia virus F3 protein] [OR:variola virus] [DB:pir2] >gp : [GI : g262433] 
[LN:S55844] [AC:S55844] [GN : L2R] [OR:Variola major virus] [SR:Variola major virus 

India-1967] [DB :genpept-vrl] [DE : G9R . . . H7R [variola major virus, India-1967, Genomic, 

19 genes, 18029 nt].] [NT : This sequence comes from Fig. 2.] [LE:1860] [RE:2123] 

[DI: direct] >gp: [GI :g438992] [LN : VARCG] [AC:L22579] [OR:Variola major virus] 

[SR: Variola major virus (strain Bangladesh- 1975) DNA] [DB : genpept -vrl] [DE: Variola 

major virus (strain Bangladesh-1975) complete genome.] [NT:homolog of vaccinia virus 

CDS L2R; putative] [LE:69256] [RE:69519] [DIrdirect] >gp : [GI : g297254] [LN : WCGAA] 
[AC:X69198] [GN:M2R] [OR:Variola virus] [DB : genpept - vrl ] [DE:Variola virus DNA complete 

genome.] [SP:P33041] [LE:68623] [RE:68886] [DI:direct] >gp : [GI : g62350] [LN: WHINDQKH] 
[AC:X67119] [GN : L2R COP] [OR: Variola virus] [DB : genpept -vrl] [DE:Variola virus 
(Hindi II -Q, K, H, M, L, I , F genome fragment) genes.] [SP:P33041] [LE:18508] [RE:18771] 
[DI:direct] >gp : [GI : e92818 :gll43683] [LN : WL8R] [AC:X76267] [OR:Variola virus] 
[DB: genpept- vrl] [DE:Variola virus (Garcia-1966 ) 

L8R,I1L,I3R,I2L,I4L,I5R / I5.5R,I6R,I7L,I8R,I9R,N1R / N2R,N3L / N4R and N5R genes.] 
[NT : ORF13R] [SP:P33041] [LE:10592] [RE:10855] [DI:direct] 



ORF Name 



NTID 



wivtt ^ NT AA 

AAID — , . _ — , , score 

LengthLength 



probability 



|AI750300098b 33219007 t3 1152 



Description 
NO-HIT 

ORF Name 



] [1148 | [4920 |p7B~||124 | 



NT AA 
NTID AAID — — _ sco re 
LengthLength 



probability 



IAI7503000985 33241093 c2 1806 



[4921 |p3TT||518 | [1534 ||5 .2e-168 



Description 

pir: [LN:A69759] [AC:A69759 ] [PN : 1 -pyrroline- 5-carboxylate dehydrogenase homolog ycgN] 
[GN: ycgN ] [CL: aldehyde dehydrogenase (NAD+) : aldehyde dehydrogenase homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82273 : g2632607] [LN : BSUB0002] 
[AC:Z99105:AL009126] [GN: ycgN] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 2 of 21): from 194651 to415810.] 
[NT: similar to 1 -pyrroline- 5-carboxylate dehydrogenase] [LE: 150401] [RE: 151948] 
[DI:direct] >gp : [GI : dl009590 :gl805393] [LN:D50453] [AC:D50453] [PN:68% identity protein 
to] [GN:ycgN] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168 trpC2) DNA] 
[DB : genpept -bet 1] [DE:Bacillus subtilis DNA for 25-36 degree region containing 
theamyE-srfA region, complete cds . ] [LE:24702] [RE:26249] [DI:direct] 



ORF Name 



NTID 



^ m NT AA 

AAID — , — . , score 

LengthLength 



|A17b03000985 33242842 cl 1379 



1150 | [4922 ||?F7T]p55~ 



probability 
2.0e-24 



Description 

pir: [LN:S59797] [AC:S59797 ] [PN : hypothetical protein YDR332w: hypothetical protein 
D9798.1] [CL:unassigned DEAD/H box helicases :DEAD/H box helicase homology] 
[OR:Saccharomyces cerevisiae] [DB:pir2] [MP:4R ] 



ORF Name 



NTID 



_ _ xt^ NT AA 

AAID — _ — score 

LengthLength 



probability 



A17503000985_33250287_ci_i513 

Description 

NO-HIT 



[4923 ||T55 | p~ 
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ORF Name 



1AI7503000985 33304082 tl 387 



Description 
NO-HIT 



ORF Name 



IAI7503000985 33359381 c2 1597 



Description 
NO-HIT 



NT AA 

NTID AAID L en^thLen^th ^^ probability 



"| [ 1152 | [ 4924 | | 129 "-|| 42 | 



NT AA 

NTID AAID — _ „ — . score probability 
LengthLength c 



] 1 P 25 |pi0~]|69 | 



ORF Name 



NTID 



„ TT% NT AA 

AAID — , — , . score 

LengthLength 



F 



17503000985 33391337 c'2 1891 



][ 



ITT5T" 



|4925 |p54 | 



][ 



T51T 



probability 
2.1e-12 



Description 

gp: [GI:g4894301] [LN: AF065404] [AC : AF0654 04] [PN:pX01-85] [OR:Bacillus anthracis] 
[DB : genpept -bct2 ] [DE: Bacillus anthracis virulence plasmid PX01, complete sequence.] 
[LE:99636] [RE:100319] [DI: direct] 



ORF Name 



UC7503000985 33555087 c2 1545 



Description 
NO-HIT 



NTID 



TTBir 



NT AA 
AAID — — score 
LengthLength 



probability 



|4927 |[141 | 



ORF Name 



117503000985 33520176 c3 2091 



NTID 



7V7VTT ^ NT AA 

AAID — — , , score 

Length Length 



j [1156 | [4328 |[858 | 1 



probability 
3.3e-47 



Description 

gp: [GI:dl020251:gl943993] [LN: AB001896] [AC : AB001896] [GN:orf30] [OR : Staphylococcus 
aureus] [SR : Staphylococcus aureus (strain: 912) DNA] [DB : genpept -bctl] 

[DE: Staphylococcus aureus DNA for sigma70 operon, complete cds . ] [LE:1501] [RE: 2310] 

[DI : direct] 



ORF Name 



NT AA 
NTID AAID — . — _ score 
LengthLength 



probability 



IAI7503000985 33837817 cl 1431 



Description 

NO -HIT 

■ ■ — ..i— i— — • ■ ii.ii i 

ORF Name 



] | 1157 | |4929 |pg— 1[42 | 



NTID 



„ TO NT AA 

AAID — — _ score 

LengthLength 



probability 



IAI7503000985 33985007 t3 1255 



Description 
NO-HIT 



1158 | [4930 ||153 1 150 | 



ORF Name 



NTID 



™™ NT AA 

AAID — » — .score 

LengthLength 



IAI7503000985 33988778 r2 888 



][ 



[4931 |pu~||219 |p^~ 



probability 
3.0e-52 



Description 

pir: [LN:JC4511] [AC:JC4511 ] [PN : pyroglutamyl -peptidase I, :bacterial pyrrolidone 
carboxyl peptidase (PYRase) ] [GN:pcp ] [OR : Staphylococcus aureus] [EC : 3 . 4 . 19 . 3] 

[DB:pir2] >gp: [GI:g790573] [LN: SAU19770] [AC:U19770] [PN : pyrrolidone carboxyl 
peptidase] [GN:pcp] [OR : Staphylococcus aureus] [DB :genpept-bctl] [DE : Staphylococcus 
aureus pyrrolidone carboxyl peptidase (pep) gene , complete cds.] [NT:pyrase] [LE:204] 

[RE: 842] [DI : direct] 



311 



ORF Name 



NTID 



NT AA 
AAID — , — , . score 
LenqthLength 



probability 



|AI7S0300098bJM0U9JV_cl_1600 | [ 1150 | [ 4932 | pgy] . |450 | [rT73-| [i . le-115 



Description 

pir: [LN:C69670] [AC:C69670 ] [PN:glycine betaine/carnitine/choline ABC transporter 
(ATP-bindin) opuCA] [GN:opuCA ] [CL: glycine betaine/proline transport protein proV: 
ATP-binding cassette homology : CBS homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:g2271389] [LN : AF009352] [AC : AF009352] [PN: ATPase] [GNiopuCA] [ORiBacillus 
subtilis] [DB:genpept-bctl] [DE: Bacillus subtilis osmoprotectant transport system OpuC 
includingATPase (opuCA) , transmembrane protein (opuCB) , osmoprotectantbinding protein 
precursor (opuCC) and transmembrane protein (opuCD) genes, complete cds . ] [NT: OpuCA; 
part of the osmoprotectant transport system] [LE:860] [RE: 2002] [DI : direct] 
>gp: [GI:ell86071:g2635896] [LN:BSUB0018] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCA] [FN: high affinity transport of glycine 
betaine,] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbE] 
[LE : 69373] [RE : 70515] [DI : complement] 



ORF Name 



AI7503000985 34021912 c2 1736 



Description 
NO-HIT 



NT AA 
NTID AAID — — , , score 
LengthLength 

1161 | [4933 |[123 1 140 | 



probability 



ORF Name 



NT AA 
NTID AAID — — _ score 
LengthLength 



probability 



] [1162 | [4934 [[TTST] |586 | pTF]|2 . 5e-134 



Description 

pir: [LN:G69769] [AC:G69769 ] [PN:pyruvate oxidase homolog ydaP] [GN:ydaP ] 
[CLracetolactate synthase large chain : thiamine pyrophosphate -binding domain homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI :dl020024 :gl881244] [LN: AB001488] [AC : AB001488] 
[GN:ydaP] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] 
[DB: genpept-bctl] [DE: Bacillus subtilis genome sequence, 148 kb sequence of the 
regionbetween 35 and 47 degree.] [NT: SIMILAR TO PYRUVATE OXIDASE AND ACETOLACTATE] 
[LE:21889] [RE:23613] [DI:direct] >gp : [GI : ell82400 :g2632734] [LN : BSUB0003 ] 
[AC:Z99106 :AL009126] [GN:ydaP] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 3 of 21): from 402751 to611850.] 
[NT:similar to pyruvate oxidase] [LE:85633] [RE:87357] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — — _ score 
LengthLength 



probability 



IAI7B0300058b 34033563 cl 1627 



] [1163 | [4935 |pT7~||108 | p«8 ||S.7e-36 



Description 

gp: [GI : el429590 :g4756150] [LN:A67161] [AC:A67161] [FN: NIRD GENE] [OR : Staphylococcus 
carnosus] [DB :genpept-pat] [DE: Sequence 1 from Patent EP0805205.] [NT:unnamed protein 
product] [LE:2589] [RE:2903] [DI: direct] >gp : [GI : g4433640] [LN : AF02 9224] 
[AC:AF029224:AF029225] [PN :NirD] [GN:nirD] [OR : Staphylococcus carnosus] 

[DB:genpept-bct2] [DE : Staphylococcus carnosus nir and nar operons, complete sequences.] 
[NT: similar to Escherichia coli NADH- dependent nitrite] [LE:4774] [RE: 5088] [DI:direct] 



ORF Name 



AI7503000985_34094i36J:i_408 

Description 

NO-HIT 



NT AA 
NTID AAID — — , , score 
LengthLength 

[1164 | [4936 — } |5i | 



probability 
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ORF Name 



NT 



AA 



IAI7503000985 341V/U07 ±2 815 



Description 
NO-HIT 



AMD Le - thLe - th scpre probability 
[1155 | [4937 ||147~||48 | 



ORF Name 



NT ID AAID 



NT 



AA 



Length Length 



score 



^17503000985 34194002 C2 1941 



1165 I [4935 |[7TT | pTS" 



probability 
,8e-64 



Description 

sp : [LN : PMGY_ZYMMO] [AC:P30798] [GN : PGM] [OR : ZYMOMONAS MOBILIS] [EC:5.4.2.1] 
[DE: (BPG -DEPENDENT PGAM) ] [SP:P30798] [DB : swissprot] >pir : [LN: C40649] [AC:C40649 ] 
[PN:phosphoglycerate mutase,] [CL :phosphoglycerate mutase :phosphoglycerate mutase 

homology] [OR : Zymomonas mobilis] [EC:5.4.2.1] [DB:pir2] >gp : [GI :gl55611] [LN : ZMOPGMA] 
[AC:L09651] [PN :phosphoglyceromutase] [GN:pgm] [OR : Zymomonas mobilis] [SR : Zymomonas 

mobilis (strain CP4) DNA] [DB : genpept-bctl] [DE : Zymomonas mobilis phosphoglyceromutase 
(pgm) gene, complete cds,and 2-hydroxyacid dehydrogenase homologue (ddh) gene, 5' end.] 
[LE:317] [RE:1003] [DI: direct] 



ORF Name 



NTID 



„ Tn NT AA 

AAID — — _ score 

LengthLength 



probability 



IAI7503000985 34197318 t3 1128 



TTST" 



[4939 ||903 | pOO ||TTrr|p.2e-113 



Description 

pir: [LN:A49943] [AC : A4 9943 : S333 58 ] [PN : f ructose-bisphosphate aldolase,] 
[OR: Staphylococcus carnosus] [EC : 4 . 1 . 2 . 13] [DB:pir2] >gp : [GI :g297874] [LN : SCFDA] 
[AC:X7172 9] [PN : f ructose-bisphosphate aldolase] [GN:fda] [OR : Staphylococcus carnosus] 
[DB:genpept-bctl] [EC : 4 . 1 . 2 . 13] [DE : S . carnosus fdagene.] [SP:Q07159] [LE:667] 
[RE: 1557] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID , 



LengthLength 



IAI7503000985 34273436 t2 498 



] [1168 | [4940 |p3~||284 |[T7? 



score probability 
|2.8e-34 



Description 

sp: [LN:YQJG_BACSU] [AC:P54544] [GN : YQ JG] [OR:BACILLUS SUBTILIS] [DE : PRECURSOR] 
[SP:P54544] [DB : swissprot ] >pir: [LN:G69963] [AC:G69963 ] [PN : lipoprotein SpoIIIJ-like 
homolog yqjG] [GN:yqjG ] [CL: stage III sporulation protein : stage III sporulation 
protein homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl013293 :gl303958] 
[LN:BACJH642] [AC :D84432 :D82370] [PN : YqjG] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 234919] [RE : 235746] [DI : complement] 
>gp: [GI:ell85657:g2634823] [LN : BSUB0013 ] [AC : Z99116 : AL009126 ] [GN:yqjG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
13 of 21): from 2395261to 2613730.] [NT:similar to lipoprotein SpoIII J-like] 
[SP:P54544] [LE:87873] [RE:88700] [DI:direct] 



ORF Name 



NTID AAID _ L , _ — 



VI750300099S 34385012 t2 465 



it 



1169 I WPTL 



NT AA 
LengthLength 
TTZZ 



score 



TUT 



probability 
6 . Oe-40 



Description 

pir: [LN:S62194] [AC:S62194 ] [PN : hypothetical protein 4] [CL : dipeptide transport 
protein] [OR : Methanosarcina barkeri] [DB:pir2] >gp : [GI : e212291 :gll24957] [LN :MBFMDSUBS] 

[AC:X93084] [GN:orf4] [OR : Methanosarcina barkeri] [DB : genpept-bctl] [DE :M. barkeri fmdE, 
fmdF, fmdA, fmdC, fmdD, fmdB, orf4, orf3, orf 2 , andorfl genes.] [LE:<1] [RE: 1588] 

[DI : complement] 



313 



ORF Name 

IA17S03000985_34410B43_c2J.590 



NT 



AA 



*™ ^2 Le^thLeiith^ore probability 



Description 
NO-HIT 



] [1170 | [4942 ||143q|476 | 



ORF Name 



NT 

NT ID AAID _ 



AA 



LengthLength 



UE7S03000985 34412952 c3 2210 



TT7T~ 



[4943 ||783 | pTO" 



score probability 
580 ||2.6e-56 



Description 

sp: [LN:YBBM_ECOLI] [AC:P77307] [GN : YBBM] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 28.2 KD 
PROTEIN IN USHA-TESA INTERGENIC REGION] [SP:P77307] [DB : swissprot] 



ORF Name 



NT ID 



NT 



AA 



AAID . . , , 



LengthLength 



score 



IAI7503000985 34414192 c3 2229 



][ 



TTTT 



[4944 |[g3T — | p76 ||H5~| [ 



probability 
0.00012 



Description 

pir: [LN:F69104] [AC:F69104 ] [PN: 3 1 , 5 1 -cyclic-nucleotide phosphodiesterase, cpdA 
homolog MTH178:Icc related protein] [GN: MTH178 ] [CL : 3 • , 5 1 -cyclic-nucleotide 
phosphodiesterase cpdA: 3 1 , 5 1 -cyclic-nucleotide phosphodiesterase cpdA 
homology : phosphoesterase core homology] [OR : Methanobac terium thermoautotrophicum] 

[EC:3.1.4.17] [DBrpirl] >gp : [GI : g2621221] [LN : AE000805] [AC : AE000805 : AE000666] [PN:Icc 
related protein] [GN :MTH178] [OR :Methanobacterium thermoautotrophicum] 

[DB:genpept-bctl] [DE : Methanobac terium thermoautotrophicum from bases 114371 to 

125512 (section 11 of 148) of the complete genome.] [NT:Function Code:10.02 - Metabolism 

of Macromolecules, ] [LE : 10465] [RE: 11055] [DI:direct] 



ORF Name 



NT AA 
NT ID AAID — _ T — score 
LengthLength 



probability 



IAI7503000985 34430428 c2 1949 



] | 1173 | I 4945 IP 75 "!! 124 |p ||0.005i 



Description 

pir: [LN:C69776] [AC.-C69776 ] [PN : hypothetical protein yddJ] [GN:yddJ ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl020089 : gl881309] [LN: AB001488] [AC : AB001488] [GN:yddJ] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 
degree.] [NT: FUNCTION UNKNOWN.] [LE: 78831] [RE: 79211] [DI:direct] 

>gp: [GI:ell82465:g2632799] [LN : BSUB0003] [AC : Z99106 : AL009126] [GN:yddJ] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [LE:142576] [RE:142956] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _^ — j , score 
LengthLength 



probability 



AI7503000985 34554692 tl 62 



] [1174 | [4946 ||474~||157 ||403 ||i.5e-37 



Description 

gp: [GI:gl575025] [LN: SAU52961] [AC:U52961] [PN : holin-like protein LrgA] [GN : lrgA] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus strain=NCTC 8325-4] 
[DB : genpept-bct2] [DE : Staphylococcus aureus holin-like protein LrgA (lrgA) and 
LrgB(lrgB) genes, complete cds . ] [NT : LytSR- regulated gene; similar to E.coli yohJ] 
[LE:369] [RE:812] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID — _ „ — , score p robabil ity 
LengthLength *- 



IAI7503000985 34585317 c2 1938 



] | 117B 1 | 4947 |p^g3Tj | 496 |pu— || 2.6e-05 - 



Description 

gp: [GI :g454844] [LN : SCMP48EGG] [AC:M74170] [OR : Schistosoma mansoni] [SR : Schistosoma 
mansoni (strain NMRI) female adult worm DNA] [DB :genpept- invl] [DE : Schistosoma mansoni 
p48 eggshell protein gene, complete cds.] [NT : ORF 3] [LE:687] [RE:1868] [DI : complement] 



314 



ORF Name 



NTID 



score 



F 



m030009«5 3461V9J7 fi 122 7 



][ 



TTTT 



NT AA 

AAID „ 

LengthLength 

[ 4948 | | 525 | | 1 7 4 |p? 



probability 
|1.3e-17 



] 



Description 

gp: [GI:g2952545] [LN: AF051898] [AC : AF0518 98 ] [PN:coronin binding protein] [GN : DB10] 
[ORrDictyostelium discoideum] [DB : genpept- invl] [DE : Dictyostelium discoideum coronin 
binding protein (DB10) mRNA, complete cds . ] [LE:108] [RE: 1790] [DI : direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000985 34647150 ci 1439 



] [1177 | [4949 |p^T]|45i | |27r~||i . 6e-23 



Description 

gp: [GI:g2570198] [LN : LSU54556] [AC:U54556] [PN :microf ilarial sheath protein SHP3] 
[GN:shp3] [OR:Litomosoides sigmodontis] [DB : genpept -invl] [DE : Litomosoides sigmodontis 

microf ilarial sheath proteins SHP3a(shp3a) and SHP3 (shp3) genes, complete cds.] 
[NT structural protein; similar to shp3 genes from] [LE : 7991 : 8260] [RE : 8047 : 9219] 
[DI : direct Join] 



ORF Name 



NTID 



„ Tr4 NT AA 

AAID — _ — , , score 

LengthLength 



probability 



IAI7503000985 35193950 c2 1811 



j |1178 | |495Q [pre [ |162 | 



Description 
NO-HIT 



ORF Name 



NT 

NTID AAID 



AA 



LengthLength 



IAI 7 503000985 35317188 c2 1684 



TTTT 



[4951 Ip^Pq pTTT 



score probability 
12779 ||2.4e-289 



Description 

gp: [GI:g3929312] [LN : AF100426] [AC : AF100426] [PN : f imbriae-associated protein Fapl] 
[GN:fapl] [OR: Streptococcus parasanguinis] [DB : genpept -be t2] [DE : Streptococcus 
parasanguis f imbriae-associated protein Fapl (fapl) gene, complete cds.] [NT:invovled in 
fimbriae assembly and f imbriae-mediated] [LE:284] [RE: 7996] [DI: direct] 



ORF Name 



NT AA 

NTID AAID — _^ — , . score probabilit y 
Length Length 



[AI7503000985 35433438 t3 1030 



TT8TT 



1137 I (ITS 



2.3e-39 



Description 

sp: [LN:GLOX_BACSU] [AC:031616] [GN : YJBR] [OR:BACILLUS SUBTILIS] [EC:1.5.3.-] 
[DE : GLYCINE OXIDASE, ] [SP:031616] [DB : swissprot] >pir : [LN : B69845] [AC:B69845 ] 
[PN:sarcosine oxidase homolog yjbR] [GN:yjbR ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83187:g2633521] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbR] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bctl] [DE: Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to sarcosine oxidase] [SP:031616] 
[LE:48816] [RE:49925] [DI: direct] 
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NT AA 

ORF Name NT ID AAID Le ^ t fr Le ^ t h score probability 



A17b03000985Jib44^3_c2_1668 | [1181 | [4953 ||831 | |276 ||103 ||0.0Qg9 



Description 

sp: [LN:YHI3_LACLA] [AC:Q02147] [OR : LACTOCOCCUS LACTIS] [SR :, SUBSPLACTIS : STREPTOCOCCUS 
LACTIS] [DE: HYPOTHETICAL 38.0 KD PROTEIN IN HISC-HISG INTERGENIC REGION (ORF3)] 
[SP:Q02147] [DB : swissprot] >pir: [LN:C45734] [AC:C45734 ] [PN : histidyl- tRNA synthetase 
homolog] [OR : Lactococcus lactis subsp. lactis] [DB:pir2] >gp : [GI :g2565140] 
[LN:LLU92974] [AC :U92 974 : M9076 0 :M90761] [PNrunknown] [OR : Lactococcus lactis] 
[DB:genpept-bctl] [DE : Lactococcus lactis unknown gene, partial cds, and HisC 
(hisC) , unknown, HisG (hisG) , unknown, HisB (hisB) , unknown, HisH (hish),HisA (hisA) , 
HisF (hisF) , HisIE (hisIE) , unknown, unknown, LeuA (leuA) , LeuB (leuB) , LeuC (leuC) , 
LeuD (leuD) , unknown, IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , IlvA (ilvA) , AldB 
(aldB)and aldR (aldR) genes, complete cds.] [NT : ORF2 ; potential regulator; similar to 
histidyl -tRNA] [LE:2146] [RE:3132] [DI:direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|AI7S030005aS_3S687J:2_8i2 | [1182 | [4954 [p? - 1 153 | 

Description 
NO-HIT 



ORF Name 



NT AA 

NTID AAID , — _ , — , , score probability 
Length Length 



AI7503000985 35817137 t3 1254 



Description 
NO-HIT 



1183 | [4955 ||T^— || bl | 



ORF Name 



NT AA 

NTID AAID — _ T — . , score probabi lity 
LengthLength c £ ~ 



1AI7503000985 35993802 c2 1685 



] [1184 | |495£ |p33T]p7 | 



Description 
NO-HIT 

ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



AI7503000985 35995316 tl 393 



] [1185 | [4957 ||I25T||418 Jpi^O | [ 



probability 
9.6e-224 



Description 

pir: [LN:S77608] [AC:S77608 ] [PN:probable intercellular adhesion protein 
A:glycosyltransf erase : icaA protein] [GN : icaA ] [OR : Staphylococcus epidermidis] 
[DB:pir2] >gp: [GI:gll61380] [LN: SEU43366] [AC:U43366] [PNrlcaA] [GN : icaA] 
[OR: Staphylococcus epidermidis] [DB :genpept-bct2] [DE : Staphylococcus epidermidis operon 
mediating intercellular adhesion : I caR, IcaA, IcaD, IcaB, and IcaC genes, complete cds.] 
[LE:761] [RE:1999] [DI:direct] 
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NT AA 

0RF Name NTID AAID Le — thLe — th ggore probability 



|A17S0300098b_3^127302_c2_1875 I [1186 I [4958 [ [ 888 | |29b | pB8 j|B . 7e-107 ~| 

Description 

sp: [LN:GTAB_BACSU] [AC:Q05852] [GN : GTAB] [OR : BACILLUS SUBTILIS] [EC:2.7.7.9] 
[DE: (GENERAL STRESS PROTEIN 33) (GSP33 ) ] [SP:Q05852] [DB : swissprot] >pir : [LN : A40650] 
[AC:A40650 :B69638 ] [PN : UTP- -glucose- 1-phosphate uridylyltransf erase ,: UDP-glucose 
pyrophosphorylase] [GN:gtaB ] [CL : Escherichia coli UTP- -glucose- 1 -phosphate 
uridylyltransferase] [OR:Bacillus subtilis] [EC:2.7.7.9] [DB:pirl] >gp : [GI :g289287] 
[LN : BACGTABX] [AC:L12272] [PN : UDP-glucose pyrophosphorylase] [GNrgtaB] [ORrBacillus 
subtilis] [SR:Bacillus subtilis (strain 168, sub_species Marburg) DNA] 
[DB:genpept-bctl] [EC:2.7.7.9] [DE:Bacillus subtilis UDP-glucose pyrophosphorylase 
(gtaB) gene, complete cds . ] [NT: similar to UDP-glucose pyrophosphorylase of] [LE:120] 
[RE:998] [DI:direct] >gp : [GI:g4 05623] [LN: BSLYTGTA] [AC:Z22516] [PN: UDP-glucose 
pyrophosphorylase] [GN:gtaB] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [EC : 2 . 7.7.9] 
[DE:B. subtil is lytR, orfX, and gtaB genes.] [NT: similar to other procaryotic 
UDP-glucose] [SP:Q05852] [LE:1581] [RE:2459] [DI:direct] >gp : [GI : ell844 73 : g2636093 ] 
[LN:BSUB0019] [AC:Z99122 :AL009126] [PN:UTP-glucose- 1-phosphate uridylyltransferase] 
[GN:gtaB] [FN : glucosylation of teichoic acid] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[EC:2.7.7.9] [DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 
3809700.] [SP:Q05852] [LE:67599] [RE:68477] [DI:direct] >gp : [GI : ell84473 :g2636093] 
[LN:BSUB0019] [AC:Z99122 :AL009126] [PN:UTP-glucose- 1 -phosphate uridylyltransferase] 
[GN:gtaB] [FN: glucosylation of teichoic acid] [OR:Bacillus subtilis] [DB:genpept] 
[EC:2.7.7.9] [DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 
3809700.] [SP:Q05852] [LE:67599] [RE:68477] [DI:direct] 

NT AA 

ORF Name NT ID AAID t — _ t — , , score probabil ity 
LengthLength L - 



Aiyb0300098S_36134678_c3_2220 | [1187 | | 49S9 [ 232 |p^"| [2.8e- ^ 



Description 

pir: [LN:F69670] [AC:F69670 ] [PN:glycine betaine/carnitine/choline ABC transporter 
(membrane p) opuCD] [GN:opuCD ] [CL: glycine betaine/carnitine/choline ABC transporter] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g2271392] [LN: AF009352] [AC : AF009352] 
[PN: transmembrane protein] [GN:opuCD] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE: Bacillus subtilis osmoprotectant transport system OpuC includingATPase (opuCA) , 
transmembrane protein (opuCB) , osmoprotectantbinding protein precursor (opuCC) and 
transmembrane protein (opuCD) genes , complete cds.] [NT:OpuCD; part of the 
osmoprotectant transport system] [LE:3627] [RE: 4316] [DI: direct] 
>gp: [GI:ell86068:g2635893] [LN : BSUB0018] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GN:opuCD] [FN: high affinity transport of glycine 
betaine,] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbB] 
[LE : 67059] [RE : 67748] [DI : complement] 

NT AA 

ORF Name NT ID AAID — — , , score probability 
LengthLength £ - 

|A*7503000985_3gi425i0_ci_i334 

Description 

NO-HIT 



[4960 |P^~| 
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NT AA 

ORF Name NTID AAID Le ~ thLe ~ th sc Q r ^ probability 



|Am03000985JS61Si(b92_c2_1840 | [ 1189 | [ 4951 || 651 ~j | pi || 5 . 

Description 

sp: [LN:YFKO_BACSU] [AC:034475] [GN: YFKO] [OR : BACILLUS SUBTILIS] [EC:1. -.-.-] 
[DE : PUTATIVE NAD ( P ) H NITROREDUCTASE YFKO,] [SP:034475] [DB : swissprot ] >pir : [LN : B69809] 
[AC:B69809 ] [PN : NAD ( P) H- flavin oxidoreductase homolog yfkO] [GN:yfkO ] 
[CL: nitroreductase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82773 :g2633107] 
[LN:BSUB0005] [AC : Z 9 9 1 0 8 : ALO 0 9126] [GN:yfkO] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 5 of 21): from 802821 
tol011250.] [NT:similar to NAD (P) H- flavin oxidoreductase] [SP:034475] [LE:51099] 
[RE:51764] [DI:direct] >gp : [GI : dl024284 :g2626827] [LN:D83967] [AC:D83967] [PN:YfkO] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis genomic DNA, 74 degree region.] [LE:16157] [RE:16822] 
[DI : complement] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|&:£7g0300098S_3620S28S_c2_i7i7 | [1190 | [T^~|[T^u~| | 259 || 284 | |6.0e-2b | 

Description 

Sp : [LN : XYNC_CALSA] [AC:P23553] [GN : XYNC] [OR : CALDOCELLUM S ACCHAROL YT I CUM ] 
[SR: , CALDICELLULOSIRUPTOR S ACCHAROL YT I CUS] [EC:3.1.-.-] [DE : ACETYL ESTERASE, 
(ACETYLXYLOSIDASE) ] [SP:P23553] [DB : swissprot] >pir : [LN : B3 7202] [AC:B37202 ] 
[PN:acetylesterase, (XynC) ] [ORrCaldocellum saccharolyticum] [EC: 3. 1.1. 6] [DB:pir2] 
>gp: [GI:gl44297] [LN: CDCXYNAB] [AC:M34459] [OR : Caldicellulosiruptor saccharolyticus] 
[SR:C. saccharolyticum DNA, clone pNZ1400] [DB : genpept-bctl] [DE : C . saccharolyticum 
xylanase A (XynA) , beta-xylosidase (XynB) andacetyl esterase (XynC) genes, complete 
cds.] [NT:acetyl esterase (XynC)] [LE:1257] [RE:2057] [DI:direct] >gp : [GI : g2645420] 
[LN:AF005383] [AC : AF005383] [PN : acetylxylosidase] [GN:XynC] [OR : Caldicellulosiruptor 
saccharolyticus] [DB : genpept - be t2 ] [DE : Caldicellulosiruptor saccharolyticus putative 
transport protein (XynG) , putative transport protein (XynH) , xylanase (XynF) , xylanase 
(XynE) , xylanase (XynD) , xylanase (XynA), acetylxylosidase (XynC) and xylanase (XynB) 
genes, complete cds.] [LE:13673] [RE:14473] [DI:direct] 



NT AA 

ORF Name NTID AAID — , — , . score probabilit y 

LengthLength c L 



|AI7503000985_362140b2_c3_1981 | [1191 | [ 4953 | 332 | pgg — [| 1 . 3e-10 

Description 

sp: [LN:Y31K_SULAC] [AC:P46218] [OR : SULFOLOBUS ACIDOCALDARIUS] [DE : HYPOTHETICAL 31.5 KD 
PROTEIN] [SP:P46218] [DB : swissprot ] >gp : [GI :g458265] [LN: SAU05664] [AC:U05664] 
[OR:Sulfolobus acidocaldarius] [DB : genpept-bctl] [DE : Sulf olobus acidocaldarius RNA 
polymerase subunit homolog gene, complete cds.] [NT : homologous to Swiss-Prot Accession 
Number P20435:] [LE:61] [RE:888] [DI:direct] 

NT AA 

ORF Name NTID AAID r — _ — , , score probability 

LengthLength c 



|A17i>0300098b_36225250_c2_i711 | [1192 | [4954 | p3?] |513 | . le-272 — | 

Description 

sp: [LN:SEPA_STAEP] [AC:P43148] [GN : SEPA] [OR : STAPHYLOCOCCUS EPIDERMIDIS] [EC : 3 . 4 . 24 . - ] 
[DE: EXTRACELLULAR ELASTASE PRECURSOR, (SEPP1) ] [SP:P43148] [DB : swissprot] 
>pir: [LN:A40659] [AC:A40659 ] [PN : elastase , SepPl] [OR : Staphylococcus epidermidis] 
[EC:3.4.24.-] [DB:pir2] >gp : [GI :g396259] [LN : SESEPP1A] [AC:X69957] [PN : protease] 
[GN:SepPl] [OR staphylococcus epidermidis] [DB : genpept-bctl] [DE : S . epidermis gene for 
protease.] [SP:P43148] [LE:164] [RE:1687] [DI:direct] 
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NT AA 

ORF Name NT ID AAID Le ~ thLe ^ t fr scor e probability 



A17503000985J362:>82b2_c3J.990 | [1193 | [4965 |pT7 | p^E || 67fl ||i.le-*>6 | 



Description 

spr [LNrHIS6_BACSU] [AC:034727] [GN : HISF] [OR : BACILLUS SUBTILIS] [DE :HISF PROTEIN 
(CYCLASE) ] [SP:034727] [DB : swissprot] >pir: [LN:B69641] [AC:B69641 ] [PN:HisF 
cyclase-like protein hisF] [GN : hisF ] [CLrcyclase hisF] [ORrBacillus subtilis] 
[DB:pir2] >gp : [GI r ell86175 r g2636000] [LN : BSUB0018] [AC : Z99121 : AL009126] [PNrHisF 
cyclase-like protein] [GNrhisF] [FN : synthesis of D-erythro- imidazole glycerol] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
18 of 21): from 3399551to 3609060.] [SP:034727] [LE:183072] [RE:183830] [DI : complement] 
>gp: [GI:g2618870] [LN : AF017113] [AC : AF017113] [PNrcyclase] [GNrhisF] [ORrBacillus 
subtilis] [DB rgenpept-bct2] [DErBacillus subtilis 300-304 degree genomic sequence.] 
[NTrHisF protein] [LE:44003] [RE:44761] [DIrdirect] 

NT AA 

ORF Name NTID AAID — _ ^ — , , score probability 
Length Length c JL 



I&I7B0300&586 c2 1545 



1194 



[4955 ||T71— \^UT 



3.4e-38 



Description 

pirr [LN:B69906] [AC:B69906 ] [PNrrarD protein homolog yojE] [GNryojE ] [CL : Escherichia 
coli rarD protein] [ORrBacillus subtilis] [DB:pir2] >gp r [GI r ell85420 r g2634341] 
[LNrBSUBOOll] [AC : Z99114 r AL009126] [GNryojE] [FNrunknown] [ORrBacillus subtilis] 
[DBrgenpept-bctl] [DErBacillus subtilis complete genome (section 11 of 21) : from 
2000171to 2207900.] [NT r similar to hypothetical proteins] [LEr 122204] [RE r 122995] 
[DI r complement] >gp r [GI rg3169320] [LN r AF026147] [AC r AF026147] [PNrYojD] [GNryojD] 
[FNrunknown] [ORrBacillus subtilis] [DB r genpept -bet 2 ] [DErBacillus subtilis YojA 
(yojA), YojB (yojB) , YojC (yojC) , YojD(yojD), YojE (yojE), YojF (yojF) , YojG (yojG) , 
YojH (yojH) , Yojl (yojl) , YojJ (yoj J) , YojK (yojK) , YojL (yojL) , YojM (yojM) , 
YojN(yojN), and YojO (yojO) genes, complete cds; and OdhA (odhA) gene, partial cds . ] 
[NT r similar to Escherichia coli RarD protein r] [LEr 2270] [RE r 3061] [DIrdirect] 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength ^ JL 



AI7503000985_35617832_clJ.459 | [1195 | [ 4967 | [gPT — | |[7F5~ 



3.5e-75 



Description 

Sp r [LN r SUMT_BACME ] [ACrP29928] [GNrCOBA] [OR r BACILLUS MEGATERIUM] [EC r 2 . 1 . 1 . 107] 
[DErMETHYLASE) (SUMT) (UROPORPHYRINOGEN III ME THYLASE) (UROM) ] [SPrP29928] 
[DBr swissprot] >pir r [LNr A42479] [ACrA42479 ] [PN r S-adenosyl -L-methionine 
uroporphyrinogen III methyltransf erase] [CL r S-adenosyl -L-methionine uroporphyrinogen 
methyltransferase] [ORrBacillus megaterium] [DBrpir2] >gp r [GI rgl42695] [LN r BACCOBA] 
[ACrM62881] [PN r S-adenosyl -L-methionine r uroporphyrinogen III] [GNrCOBA] [ORrBacillus 
megaterium] [SRr Bacillus megaterium DNA] [DBrgenpept-bctl] [DErBacillus megaterium 
S-adenosy-L-methionine r uroporphyrinogen Illmethyltransf erase (COBA) gene, complete 
cds.] [NTrATCC #1078] [LEr 119] [RE r 835] [DIrdirect] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c JL 



|&I7S0300098S_38876J:2_4ei 

Description 

NO-HIT 



|ii96 I [4968 |pn—| p~ 
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NT AA 

QRF Name NTID AAID Le ^ t frLength score probability 



|A17B0300098i>_J906718_£l_67 | [1197 | [4969 |354 | |74 7 j|b-2e-74 

Description 

pir: [LN:H69789] [AC:H69789 ] [PN:L-iditol 2 -dehydrogenase , homolog ydjL : sorbitol 
dehydrogenase homolog] [GN:ydjL ] [CLialcohol dehydrogenase : long- chain alcohol 
dehydrogenase homology] [OR:Bacillus subtilis] [EC : 1 . 1 . 1 . 14] [DB:pirl] 
>gp: [GI:dl023636:g2522016] [LN : AB007638] [AC : AB007638] [PN : dehydrogenase] [GNiydjL] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : Marburg 168) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis genomic DNA containing gutA to cotA region, 
48degree.] [NT : putative] [LE:9403] [RE: 10443] [DI : complement] 

>gp: [GI:ell82603 :g2632937] [LN: BSUB0004] [AC : Z99107 : AL009126] [GN:ydjL] [FN:unknown] 
[OR:Bacillus subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 
4 of 21): from 600701 to813890.] [NT:similar to L-iditol 2 -dehydrogenase] [LE:76928] 
[RE: 77968] [DI complement] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



&I75030005a5_350937g_c2_lg51 | [1198 | [ 4970 || 39g ~|| 13i ||2gg— || i . Oe-g P 



Description 

pir: [LN:H64716] [AC:H64716 ] [PN:ABC transporter, ATP-binding protein] [CL : ATP-binding 
cassette homology] [OR: Helicobacter pylori] [DB:pir2] >gp : [GI :g2314761] [LN : AE000655] 
[AC:AE000655 :AE000511] [PN:ABC transporter, ATP-binding protein (abc) ] [GN:HP1576] 
[OR Helicobacter pylori 26695] [DB :genpept-bct2] [DE Helicobacter pylori 26695 section 
133 of 134 of the completegenome . ] [NT:similar to GB:L42023 SP:P44785 PID:1005459] 
[LE:1828] [RE:2811] [DI:direct] 

NT AA 

QRF Name NTID AAID — — , score probability 
LengthLength -L 



AI750300098g_39i0g7bJ:3_1222 | [1199 | | 497i |p^~| | 77 || 57 [ 1 0,0048 



Description 

gp: [GI:g3293452] [LN: AF069160] [AC : AF069160] [PN:NADH dehydrogenase 1] [FN : respiratory 
enzyme] [OR : Mitochondrion Aphidius ervi] [SR:Aphidius ervi] [DB : genpept- invl] 

[DE:Aphidius ervi NADH dehydrogenase 1 gene, mitochondrial geneencoding mitochondrial 
protein, partial cds . ] [LE:<1] [RE:>465] [DI:direct] >gp : [GI :g3293458] [LN: AF069163] 

[AC:AF069163] [ PN : NADH dehydrogenase 1] [FN : respiratory enzyme] [OR: Mitochondrion 
Aphidius picipes] [SR: Aphidius picipes] [DB : genpept -invl] [DE: Aphidius picipes NADH 
dehydrogenase 1 gene, mitochondrial geneencoding mitochondrial protein, partial cds.] 

[LE:<1] [RE:>465] [DI:direct] >gp : [GI : g3293460] [LN : AF069164] [AC : AF069164] [ PN : NADH 
dehydrogenase 1] [FN : respiratory enzyme] [OR: Mitochondrion Aphidius pisivorus] 

[SR: Aphidius pisivorus] [DB : genpept -invl] [DE: Aphidius pisivorus NADH dehydrogenase 1 
gene, mitochondrial geneencoding mitochondrial protein, partial cds.] [LE:<1] [RE:>465] 

[DI: direct] >gp : [GI :g3293464] [LN: AF069166] [AC : AF069166] [PN: NADH dehydrogenase 1] 

[FN: respiratory enzyme] [OR : Mitochondrion Aphidius sonchi] [SR: Aphidius sonchi] 

[DB : genpept- invl] [DErAphidius sonchi NADH dehydrogenase 1 gene, mitochondrial 
geneencoding mitochondrial protein, partial cds.] [LE:<1] [RE:>465] [DI: direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



|AI750300098b 3923842 c2 1832 



[4972 |pi~||296 ||876 ||1 ■ le-87 



Description 

gp: [GI :g4959402] [LN : AF1153 91] [AC : AF115391] [PN:putative ribose transporter RbsU] 
[GN:rbsU] [OR : Lactobacillus sakei] [DB : genpept -bct2] [DE : Lactobacillus sakei LaaA 
(laaA) gene, partial cds; LaaB (laaB) , putative acetate kinase AckA (ackA) , LaaC (laaC) 

genes, completecds; rbs operon, complete sequence; and LaaE (laaE) gene, partialcds.] 
[LE:3130] [RE:4014] [DI: direct] 
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ORF Name 



!AlV50300098b 39281b2 T2 655 



Description 
NO-HIT 



ORF Name 



[AI7503000985 3928416 ti 170 



Description 
NO-HIT 



NTID 



NT AA 
AAID — < _ — , . score 
LengthLength 



probability 



] [1201 | [4973 |pn~||46 | 



NTID 



NT AA 
AAID — T — ^ score 
LengthLength 



probability 



1202 | [4974 |40 | 



ORF Name 



NTID 



IA17S03000985 3933177 12 683 



TIUT 



Description 
NO-HIT 



____ NT AA 

AAID — , — .score 

LengthLength 

|4975 |[TT5 [ |41 | 



probability 



ORF Name 



NT AA 
NTID AAID „ — , — , score 
Length Length 



IA17503000985 3941078 ti 114 



] [1204 | [4976 — | [107 | [Im- 



probability 
1.6e-06 



Description 

pir: [LN:C69845] [AC:C69845 ] [PN : hypothetical protein yjbS] [GN:yjbS ] [OR:Bacillus 
SUbtilis] [DB:pir2] >gp: [GI : ell83188 :g2633522] [LN: BSUB0007] [AC : Z99110 : AL009126] 
[GNryjbS] [FNiunknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:49925] [RE:50125] 
[DI: direct] 



ORF Name 



NTID 



AAID 



IA1750300098& 3941436 11 6 



Description 
NO-HIT 



NT AA 
LengthLength 
] [1205 | [4977 [pg— 1|49 | 



score probability 



ORF Name 



NT 



AA 



NTID AAID 4 _ UT _ 

LengthLength 



score 



[A17503000985 3953400 11 314 



] [1205 | [4978 ||195 "| [64 [Im- 



probability 
10.017 



Description 

sp: [LN:RFA1_KLEPN] [AC:Q48475] [GN : RFBA] [OR : KLEBSIELLA PNEUMONIAE] [DE : O- ANTIGEN 
EXPORT SYSTEM PERMEASE PROTEIN RFBA] [SP:Q48475] [DB : swissprot] >pir: [LN:S60882] 
[AC:S60882 ] [PN: integral membrane O-antigen translocator protein rfbA] [GN:rfbA ] 
[CL: integral membrane O-antigen translocator protein rfbA] [OR : Klebsiella pneumoniae] 
[DB:pir2] >gp: [GI :g567182] [LN : KPNRFBA] [AC:L31775] [PN:integral membrane O-antigen 
translocator] [GN:wzm] [OR : Klebsiella pneumoniae] [DB :genpept-bct2] [DE : Klebsiella 
pneumoniae integral membrane O-antigen translocatorprotein (wzm) and ATP-binding 
protein (wzt) genes, complete cds;and WbbM (wbbM) gene, partial cds . ] [NTrsimilar to 
kpsM (E. coli) , bexA (H. influenzae),] [LE:324] [RE:1103] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — — , - scor e 
LengthLength 



probability 



AI7503000985_4004643_t3_1285 

Description 

NO-HIT 



] [1207 [ [4979 || 201 "[ p | 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



F 



1750300098b 40fe%43 c2 2243 



1208 | [4980 ||480 1 1159 | |IgO~||fl . 2e-12 



Description 

gp: [GI:el310302:g3294247] [LN:SC7C7] [AC : AL03103 1] [PN : hypothetical protein SC7C7.14] 
[GN:SC7C7.14] [OR : Streptomyces coelicolor] [DB : genpept-bctl] [DE : Streptomyces 
coelicolor cosmid 7C7.] [NT : SC7C7 . 14 , unknown, len: 161 aa] [LE: 24125] [RE: 24610] 
[DI redirect] 



ORF Name 



^17503000985 4079552 ±2 695 



NTID 



NT 



AA 



AAID _ , _ — 



LengthLength 



score 



T7UT 



[4981 ||630 ||209 ||87 | [ 



probability 
0.0012 



Description 

gp: [GI:g252560] [LN: S44068S2] [AC:S44069] [GN:VP4/VP2] [OR : Hepatitis A virus] 
[SR:Hepatitis A virus LSH/S] [DB : genpept-vrl] [DE : VP1 . . . 3C/3D [hepatitis A virus HAV, 

LSH/S, Genomic, 6 genes, 266nt, segment 2 of 7].] [NT : This sequence comes from Fig. 3. 
[LE:1] [RE: 266] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID _ , — 



LengthLength 



score 



probability 



IAI7503000985 4079626 tl 336 



[1210 | [4982 ||63 ||775 1|0.018 



Description 

sp: [LN:UTMP_BOVIN] [AC:P46201] [OR:BOS TAURUS] [SR:, BOVINE] [DE : UTERINE MILK PROTEIN 
PRECURSOR (UTMP) ] [SP:P46201] [DB : swissprot] >gp : [GI : g4384 81] [LN:BOVSPIS] [AC:L22095] 
[PN: serine proteinase inhibitor precursor] [OR: Bos taurus] [DB :genpept-mam] [DE:Bos 
taurus uterine milk protein precursor, mRNA, complete cds . ] [NTruterine milk protein] 
[LE:19] [RE:1398] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000985 4084635 12 483 



][ 



T2TT" 



[4983 |p^l~[ |46 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID , — , 

LengthLength 



score 



IAI7503000985 4084717 t3 961 



1212 | [4984 ||621 1 1206 | 



TTTT 



probability 
10.00027 



Description 

pir : [LN:B70045] [AC:B70045 ] [PN : hypothetical protein yvpB] [GN:yvpB ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : ell86182 :g2636007] [LN : BSUB0018] [AC : Z99121 : AL009126] 
[GN:yvpB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 18 of 21): from 3399551to 3609060.] [LE:189121] [RE:189873] 
[DI:direct] >gp : [GI :g2618863] [LN : AF017113] [AC : AF017113 ] [PN:YvpB] [GN:yvpB] 
[OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 300-304 degree genomic 
sequence.] [LE: 37960] [RE: 38712] [D I : complement] 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



AI7503000985_4093932_li_69 

Description 

NO-HIT 



2 [1213 | [4985 |p5~ ||44 | 
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NT AA 

ORF Name NTID AAID Le ~ thLe ^ t fr scor ^ probability 

|Ai7b03000985_4094703_cl_144g | [1214 | [4985 | puT?"|p3 [ gig || 3.7e-21 — 

Description 

pin [LN:B70001] [AC:B70001 ] [PN:ABC transporter (permease) homolog ytsD] [GNrytsD ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85910 :g2635521] [LN : BSUB0016] 
[AC:Z99119:AL009126] [GNrytsD] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DEiBacillus subtilis complete genome (section 16 of 21): from 2997771to 3213410.] 
[NT: similar to ABC transporter (permease)] [LE: 110708] [RE: 112648] [DI : complement] 
>gp: [GI:g2293178] [LN : AF008220] [AC : AF00822 0] [PN : YtsD] [GN : ytsD] [ORrBacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic region.] 
[NT: similarity to NADH dehydrogenase] [LE : 67779] [RE: 69719] [DI: direct] 

NT AA 

ORF Name NTID AAID — _ „ — _ score probability 
LengthLength c 

Al760300098i>_40966_c3_2016 | [1215 | [4987 |pT2Tj 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — , _ sco re proba bility 
LengthLength ^ *~ 

AlVb0300098B_4098S18J:i_48 | [1216 | [ 4988 ||7T1~] | 236 | pT~~| [b . 8e-08 



Description 

sp : [LN : ARP_PLAFA] [AC:P04931] [OR : PLASMODIUM FALCIPARUM] [DE : ASPARAGINE-RICH PROTEIN 
(AG319) (ARP) (FRAGMENT)] [SP:P04931] [DB : swissprot] >pir: [LN:A23770] [AC:A23770 ] 
[PN:asparagine- rich protein] [OR : Plasmodium falciparum] [DB:pir2] >gp : [GI :gl60092] 
[LN : PFAARP] [AC :M24328 : X03716] [PN : asparagine-rich protein] [GN:Ag319] [OR : Plasmodium 
falciparum] [SR : Plasmodium falciparum (Ghanaian isolate NF7) asexual bloodstage] 
[DB:genpept-invl] [DE : Plasmodium falciparum asparagine-rich protein (ARP), partial 
cds.] [LE:1] [RE: 1612] [DIrdirect] 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength 



Al7b0300098S_41004S3_c3_2260 | [1217 | [ 4989 [[TuTT] | 347 | | 329 || 1.0e-29 



Description 

Sp : [LN : DEGS_BACBR] [AC:P54663] [GN : DEGS] [OR:BACILLUS BREVIS] [EC:2.7.3.-] [DE : SENSOR 
PROTEIN DEGS J [SP:P54663] [DB : swissprot] >pir : [LN : 139834] [AC:I39834 ] [PN:protein 
kinase] [GN:degS ] [CL : regulatory protein degS] [OR: Bacillus brevis] [DB:pir2] 
>gp: [GI:g710495] [LN : BACDEGSU] [AC:L15444] [PN:protein kinase] [GN:degS] 
[OR:Brevibacillus brevis] [SR:Bacillus brevis (strain Alk36) DNA] [DB :genpept-bct2] 
[DE: Bacillus brevis protein kinase (degS) gene, complete cds ; transcriptional activator 
protein (degU) gene, complete cds.] [NT : puta t i ve ] [LE:398] [RE: 1558] [DI:direct] 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength c -L 



A17B0300098B_4i00938_c2_i8gQ | |12i8 | [ 4990 || 534 ^ |i77 | | 293 || i,ie-2B 



Description 

sp: [LN:GUDT__BACSU] [AC:P42237] [GN: YCBE] [OR:BACILLUS SUBTILIS] [DE : PROBABLE GLUCARATE 
TRANSPORTER] [SP:P42237] [DB : swissprot] >pir : [LN :H69752] [AC:H69752 ] [PN : probalble 
glucarate transporter] [GN:ycbE ] [CL:hexuronate transporter] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : dl 007040 :g709999] [LN: BACYCB20] [AC:D30808] [PN : glucarate 
dehydratase] [GN: ycbE] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain : 168TrpC2 ) 
DNA] [DB:genpept-bctl] [DE: Bacillus subtilis DNA around 20 degrees region of 
chromosomecontaining yckA-T genes.] [LE:3924] [RE: 5291] [DI:direct] 

>gp: [GI:ell82200:g2632534] [LN: BSUB0002] [AC : Z99105 : AL009126] [GN:ycbE] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [NT:similar to glucarate transporter] [SP:P42237] 
[LE: 75738] [RE: 77105] [DI:direct] 



323 



ORF Name 



lAivboaoooaab 4110712 ci 1355 



Description 
NO-HIT 



NT AA 
NTID AAID — «^ — . , score 
Length Length 



probability 



] [1219 I [4991 ||S91~||196 | 



ORF Name 



^7503000985 4112527 t2 563 



Description 
NO-HIT 



NTID 



T72TT 



NT AA 
AAID — _ — , , score 
LengthLength 



probability 



|4992 



ITT? I 



TT 



ORF Name 



NTID 



„ T „ NT AA 

AAID — , — , . score 

LengthLength 



probability 



IAI7503000985 4112788 cl 1497 



Description 
NO-HIT 

ORF Name 



1221 | [4993 [[151— 1|46 | 



NTID 



NT 



AA 



AAID . , 

LengthLength 



score probability 



AI7503000985 4148428 tl 160 



1222 | [4994 |piiT] |382 ||528 ||8.3e-5i 



Description 

sp: [LN:ADH_ALCEU] [AC:P14940] [GN:ADH] [OR : ALCALIGENES EUTROPHUS] [EC : 1 . 1.1.1] 
[DE : ALCOHOL DEHYDROGENASE, ] [SP:P14940] [DB : swissprot] >pir : [LN : A30196] [AC:A30196 ] 
[PN:probable alcohol dehydrogenase,] [CLralcohol dehydrogenase : long -chain alcohol 
dehydrogenase homology] [OR : Alcaligenes eutrophus] [EC: 1.1. 1.1] [DB:pirl] 
>gp: [GI:gl41900] [LN : AFAADH] [AC:J03362] [OR : Ralstonia eutropha] [SR : A. eutrophus 
(strain H16) DNA, clone SR18] [DB :genpept-bctl] [DE : A. eutrophus alcohol dehydrogenase 
(ADH) gene, complete cds . ] [NTralcohol dehydrogenase (EC 1.1.1.1)] [LE:458] [RE:1558] 
[DI : direct] 



ORF Name 



NTID 



AAID 



1A17503000985 4164026 r2 793 



][ 



\T77T 



NT AA 
LengthLength 
[4995 |P^~| [5T 



score probability 
9.1e-09 



1EO 



Description 

pir: [LN:S68609] [AC:S68609 ] [PN : recombinase Sin] [CL : transposase repressor] 
[OR: Staphylococcus aureus] [DB:pir2] >gp : [GI :g495089] [LN : STASINA] [AC:L23109] 
[PN: recombinase] [GN:sin] [OR : Staphylococcus aureus] [SR : Staphylococcus aureus DNA; 
Transposon Tn4002 ( transposable elemen] [DB :genpept-bctl] [DE : Staphylococcus aureus 
recombinase (sin) gene, complete cds.] [NT : putative] [LE:389] [RE: 997] [DI: direct] 



ORF Name 



IA17503000985 4167842 t2 530 



NT AA 
NTID AAID — — ^ score 
Length Length 

[1224 | [ 4996 [ pT~] |66 | 



probability 



Description 
NO-HIT 



ORF Name 



NTID AAID — ^ — score 

LengthLength 



probability 



A17503000Sfl5_4179S35_c3_2003 

Description 

NO-HIT 



[1225 | [4997 |p5~| p~ 



324 



NT AA 

ORF Name NTID AAID Le ^ t frLength score probability 



AI7503000985_4303377_r2J)i0 | [1226 | [ 4998 [[ 2 7 0 | [89 || 8i | |0.00i "9r 



Description 

gp: [GI:g4049784] [LN : AF063866] [AC : AF063866] [PNrORF MSV234 hypthetical protein] 
[GN:MSV234] [OR : Melanoplus sanguinipes entomopoxvirus] [DB : genpept - vrl ] [DE : Melanoplus 
sanguinipes entomopoxvirus, complete genome.] [LE: 201477] [RE: 201830] [DI : complement] 

NT AA 

ORF Name NTID AAID — — _ score proba bility 
Length Length *- 



IAI7503000985 4329453 cl 1533 



1227 | [4999 |p?5~| [ill [|220 ||3.6e-18 



Description 

pir: [LN:G69781] [AC:G69781 ] [PN : thioredoxin homolog ydfQ] [GN:ydfQ ] [OR:Bacillus 
SUbtilis] [DB:pir2] >gp : [GI :dl020138 :gl881358] [LN : AB001488] [AC : AB001488] [GNiydfQ] 
[ORrBacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] [DB : genpept -be tl] 
[DE: Bacillus subtilis genome sequence, 148 kb sequence of the regionbetween 35 and 47 
degree.] [NT : SIMILAR TO THIOREDOXIN.] [LE:131947] [RE:132285] [DIrdirect] 
>gp: [GI:ell82517:g2632851] [LN: BSUB0003] [AC : Z99106 : AL009126] [GN:ydfQ] [FN:unknovm] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 
3 of 21): from 402751 to611850.] [NT:similar to thioredoxin] [LE:195693] [RE:196031] 
[DI: direct] 

NT AA 

ORF Name NTID AAID — _ T — , score probability 
LengthLength L - 



LAI750300098b 4334818 12 854 1 1 1228 I 1 5000 1 11555 | [571 IfTTT 



] [1228 | [5000 ||T555~|pi ||lli ||0.018 



Description 

pir: [LN:D71618] [AC:D71618 ] [PN : hypothetical protein PFB0285c] [GN:PFB0285c ] 
[OR: Plasmodium falciparum] [DB:pir2] >gp : [GI : g3845144] [LN : AE001385] 
[AC:AE001385:AE001362] [PN : hypothetical protein] [GN : PFB0285c] [OR : Plasmodium 
falciparum] [SR: malaria parasite P. falciparum] [DB : genpept -inv2] [DE : Plasmodium 
falciparum chromosome 2, section 22 of 73 of thecomplete sequence.] [NT: predicted by 
GlimmerM] [LE:303] [RE: 4613] [DI : complement] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000985_4336088_c3_2078 | [1229 | [ 5001 | p^| | 20 7 ||283 || 7.6e-25 



Description 

pir: [LN:F69877] [AC:F69877 ] [PN : uroporphyrin- III C-methyltransf erase homolog ylnF] 
[GN:ylnF ] [CL:Aquifex aeolicus siroheme synthase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :e332189:g2462962] [LN : BSPYREYLO] [AC : AJ000974] [PN: YlnF protein] [GN: ylnF] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis pyrE to yloA gene 
region.] [LE:5769] [RE:6257] [DI:direct] >gp : [GI : ell85155 :g2633936] [LN: BSUB0009] 
[AC:Z99112 :AL009126] [GN:ylnF] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to uroporphyrin- III C-methyltransf erase] [LE:36582] [RE:37070] [DI:direct] 

NT AA 

ORF Name NTID AAID — _^ — , , score probab ility 
LengthLength L 



AI7503000985_43797g3_i:3_1048 | [1230 | [ 5002 | p71~[| 40 | 



Description 
NO -HIT 



325 



ORF Name 



NT ID AAID 



NT 



LengthLength 



AA 

— L , score 



|^VB030009flb_4J927b7_cl_1619 | [1231 | [5003 ||43b ~| |144 | |196 | [ 



probability 
18 . Oe-15 



Description 

gp: [GI:el314293:g3395543] [LN:SC4A2] [AC : AL031182] [PN:putative sugar transferase] 
[GN:SC4A2 . 10c] [OR : Streptomyces coelicolor] [DB : genpept-bctl] [DE : Strep tomyces 

coelicolor cosmid 4A2 . ] [NT : SC4A2 . 10c , probable sugar transferase, len: 478/] [LE:9579] 
[RE: 11015] [DI : complement] 



ORF Name 

IAI7503000985 4459375 ±2 830 



NT 



AA 



^ ™R Le^thLe^th ^g probability 



Description 
NO-HIT 



1232 | poo4 |prrr~] p~ 



ORF Name 



NTID 



AA 



NT 

AAID — — score 
LengthLength 



probability 



17503000985 4487550 £3 980 



[1233 | [5005 ||T7Zrr|| B6B | [1408 | | 4 . Ve-144 



Description 

pir: [LN:D69748] [AC : D69748 : 14 0419 ] [PN:amino acid transporter homolog ybeC] [GN:ybeC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl034075 :g3599634] [LN: ABO 064 24] [AC : AB006424 ] 
[GNrybeC] [OR:Bacillus subtilis] [SRrBacillus subtilis (strain: 168) DNA] 
[DB: genpept-bctl] [DE:Bacillus subtilis genomic DNA, 70 kb region between 17 and 
23degree.] [LE:34338] [RE:35957] [DI:direct] >gp: [GI :ell82164:g2 632498] [LN: BSUB0002] 
[AC:Z99105 :AL009126] [GN:ybeC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 2 of 21): from 194651 to415810.] 
[NT: similar to amino acid transporter] [LE : 36687] [RE: 38306] [DI: direct] 



ORF Name 



NTID 



„ Tn NT AA 

AAID — — _ score 

LengthLength 



[A17503000 985_4491713_t2__533 



probability 
S.le-39 



Description 

pir : [LN:F64554] [AC:F64554 ] [PN : guanosine pentaphosphate phosphohydrolase] 
[OR Helicobacter pylori] [DB:pir2] >gp: [GI : g2313368] [LN : AE000546] 
[AC:AE000546:AE000511] [PN : guanosine pentaphosphate phosphohydrolase] [GN:HP0278] 
[OR Helicobacter pylori 26695] [DB :genpept-bct2] [DE Helicobacter pylori 26695 section 

24 of 134 of the complete genome.] [NT: similar to GB:M87049 SP:P25552 GB:M83316 

PID:148183] [LE:6964] [RE:8418] [DI:direct] 



ORF Name 



NTID AAID 



IAI7503000985 455316S £3 1122 



TZTT 



Description 
NO-HIT 



NT AA 
LengthLength 
[5007 j|T73 [ p 



score probability 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



[M7503000985 4574012 ti 208 



1235 | [5008 |pU^T||359 [pJTJ 



score probability 
|2.0e-58 



Description 

pir : [LN: JN0500] [AC : JN0500 : S3 4 96 7 ] [PN : dihydroorotate oxidase , , 

mitochondrial : dihydroorotate dehydrogenase] [GN:dhod ] [CL : dihydroorotate oxidase] 
[OR:Drosophila melanogaster] [EC: 1.3. 3.1] [DB:pir2] 



326 



NT AA 

ORF Name NTID AAID Le ~ thLe ^ t h SCQ: re probability 

|Al7b0300098b_4662J:2J>lj9 | [ 1237 | [ 5009 |pT8T~ | [ 105 |[ K4 || 0 . 0046 | 

Description 

gp: [GI:g4049856] [LN: AF063866] [AC : AF063 866] [PN:ORF MSV064 hypothetical protein] 
[GN:MSV064] [OR : Melanoplus sanguinipes entomopoxvirus] [DB :genpept-vrl] [DE : Melanoplus 
sanguinipes entomopoxvirus, complete genome.] [LE: 64316] [RE: 65158] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ — _ score probability 
LengthLength 

AI75030009a5_46a7705_c2_1841 | [1238 | [5010 |pn~| |46 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

A17S030009flS_46fl77S2_cl_lS14 1 [1239 | | 501i |[IT55-| |394 [ [771 | |9 . le-77 



Description 

pir: [LN:F69863] [AC:F69863 ] [PN : aspartate aminotransferase homolog ykrV] [GN:ykrV ] 
[CL: aspartate transaminase] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell84948 :g2633729] 
[LN:BSUB0008] [AC : Z99111 : AL009126] [GN:ykrV] [FNrunknown] [ORrBacillus subtilis] 
[DB :genpept-bctl] [DErBacillus subtilis complete genome (section 8 of 21): from 
1394791to 1603020.] [NT: similar to aspartate aminotransferase] [LE: 30317] [RE: 31513] 
[DI: direct] 



ORF Name 



NT AA 

NTID AAID — — _ score probability 
LengthLength -L 



IAI75030009BS 4689390 c3 2124 



] [1240 | [5012 [[TT7— 1|48 | 



Description 
NO-HIT 



ORF Name 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



^17503000985 4594153 c2 1791 



2 [1241 | [5013 ||144 | [47 | 



Description 
NO-HIT 



ORF Name 



NT AA 

NTID AAID Le ~ thLe ~ t fr score probability 



U750300098S 4694552 c3 2199 



Description 
NO-HIT 



^ [1242 | [5014 |p5~ j |44 | 



ORF Name 



NT AA 

NTID AAID Le ^ thLe ~ t h score probability 



|M750300098S_469S45i_i2_864 



] [1243 | [5015 |P^~| pT 



Description 
NO-HIT 



ORF Name 



NT AA 

NTID AAID — . — , , score probabi lity 
LengthLength L - 



IAI7503000985 4698428 c2 1705 



] [1244 | [5015 |pu~]|59 | 



Description 
NO-HIT 
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ORF Name 



NT AA 
NTID AAID — ~ — score 
Length Length 



probability 



IAI7503000985 470b077 t2 880 



J [1245 | [5017 ||12i~||40 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — « — , , score 
LengthLength 



17503000985 4715377 12 S18 



T2TT 



5018 11447 I fTTO 



probability 
3.4e-05 



Description 

gp: [GI:el407826:g4493896] [LN : PFMAL3P2] [AC : AL034558 ] [GN : MAL3P2 . 18] [OR : Plasmodium 
falciparum] [SR:malaria parasite P. falciparum] [DB : genpept - invl ] [DE : Plasmodium 
falciparum MAL3P2, complete sequence.] [NT : predicted using hexExon; MAL3P2.18 
(PFC0245c),] [LE:84094] [RE:95895] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



probability 



E 



17503000985 4719775 c3 2182 



] [1247 | [5019 ||7T^~||247 [|434 ||7.6e-41 



Description 

sp: [LN:YWAC_BACSU] [AC:P39583] [GN : YWAC : IPA-7D] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
24.6 KD PROTEIN IN DAE-TYRZ INTERGENIC REGION] [SP:P3 9583] [DB : swissprot] 
>pir: [LN:S39662] [AC : S3 9662 : C70050 ] [PN : GTP-pyrophosphokinase homolog ywaC:protein 
ipa-7d] [GN:ywaC ] [CL:GTP pyrophosphokinase related protein] [ORrBacillus subtilis] 
[DB:pir2] >gp : [GI : g413 931] [LN : BSGENR] [AC:X73124] [GN:ipa-7d] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39583] [LE:6981] 
[RE:7613] [DI: direct] >gp : [GI :ell86347:g2 636383] [LN : BSUB002 0] [AC : Z99123 : AL009126] 
[GNrywaC] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis 
complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate gene name: 
ipa-7d; similar to] [SP:P39583] [LE:150346] [RE:150978] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — ^ UT — score 
LengthLength 



probability 



IAI7503000985 4719827 c3 1986 



Description 
NO-HIT 



] [1248 | [5020 ||417 | [138 | 



ORF Name 



NTID 



NT AA 
AAID — _ T — , . score 
LengthLength 



IA17503000985 4720317 t2 475 



T2T9" 



probability 
1.2e-170 



Description 

gp: [GI:g4574233] [LN: AF106849] [AC : AF106849] [PN: FmhA] [GN:fmhA] [OR : Staphylococcus 
aureus] [DB : genpept -be t2] [DE : Staphylococcus aureus FmhA (fmhA) gene, complete cds.] 
[NT: similar to Staphylococcus aureus FemA and FemB] [LE:201] [RE: 1451] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IAI7503000985 4723510 c2 1759 



] [1250 | [5022 |p^T]|886 ||2374| 



2.0e-245 



Description 

sp: [LN:PODK_CLOSY] [AC:P22983] [GN : PPDK] [OR : CLOSTRIDIUM SYMBIOSUM] 
SYMBIOSUS] [EC:2.7.9.1] [DE : DIKINASE) ] [SP:P22983] [DB : swissprot] 



[SR: , BACTEROIDES 
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ORF Name 



NT AA 
NTID AAID — - T — .score 
LengthLength 



probability 



AI7503000985 4725068 c3 2219 



] [1251 | [5023 ||545 | [214 | p?— [j5 . 8e-59 



] 



Description 

pir: [LN:D69670] [AC:D69670 ] [PNrglycine betaine/carnitine/choline ABC transporter 
(membrane p) opuCB] [GN:opuCB ] [CL: glycine betaine/carnitine/choline ABC transporter] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g2271390] [LN : AF009352] [AC : AF009352] 
[PN: transmembrane protein] [GNropuCB] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis osmoprotectant transport system OpuC includingATPase (opuCA) , 
transmembrane protein (opuCB) , osmoprotectantbinding protein precursor (opuCC) and 
transmembrane protein (opuCD) genes, complete cds J [NT: OpuCB; part of the 
osmoprotectant transport system] [LE:2025] [RE: 2678] [DI: direct] 
>gp: [GI:ell86070:g2635895] [LN : BSUB0018] [AC : Z99121 : AL009126] [PN:glycine 
betaine/carnitine/choline ABC] [GNropuCB] [FN: high affinity transport of glycine 
betaine,] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 18 of 21): from 3399551to 3609060.] [NT : alternate gene name: yvbD] 
[LE: 68697] [RE: 69350] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



[AI7503000985 4726636 ci 1545 



[5024 Ip^TB [ [45 [ 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



LAI7503000985 4727187 c2 1702 



1253 | [5025 ||288T]|962 |p7 



probability 
7.4e-58 



Description 

sp: [LN:PIP_LACLA] [AC:P49022] [GN:PIP] [OR : LACTOCOCCUS LACTIS] 

[SR: ,SUBSPLACTIS: STREPTOCOCCUS LACTIS] [DE : PHAGE INFECTION PROTEIN] [SP:P49022] 
[DB:swissprot] >gp : [GI :g308861] [LN : LACPIP] [AC:L14679] [GN:pip] [FN: required for phage 
infection] [OR : Lactococcus lactis] [SR : Lactococcus lactis (strain C2, sub_species 
lactis) DNA] [DB:genpept-bctl] [DE : Lactococcus lactis pip and gerC2 genes, complete 
cds's, and rrggene, 5' end of cds.] [NT : GTG start codon] [LE:391] [RE:3096] [DI:direct] 



ORF Name 



NTID AAID . — , . — 



LAI7503000985 4728187 cl 1355 



T23T" 



Description 
NO-HIT 



NT AA 
LengthLength 
[5026 [ [77 



score 



probability 



ORF Name 



E 



17503000985 4735937 t3 1114 



NTID 



_ _ ___ NT AA 

AAID — — , , score 

LengthLength 



probability 



Description 
NO-HIT 



[5027 ||138 [ [¥F 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



(AI7503000985 4737550 c3 2248 



] [1256 | [5028 ||759~||252 [|387 | [ 



probability 
7.3e-36 



Description 

gp: [GI:g4433637] [LN : AF029224] [AC : AF029224 : AF029225] [PN:NirR] [GN:nirR] [FN: involved 
in nitrite reduction] [OR: Staphylococcus carnosus] [DB :genpept-bct2] [DE : Staphylococcus 
carnosus nir and nar operons, complete sequences.] [LE:1184] [RE: 1906] [DI: direct] 
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ORF Name 

IA17503000985_477721 7 _c3_225 7 



NTID 



NT AA 
AAID — T — L score 
Length Length 



probability 



■j [1257 | [ 5029 || 1569 ||522 | pg2B"|| 7 . 9e-252 



Description 

gp: [GI:el429596 :g4756152] [LN:A67161] [AC:A67161] [ FN : NARH GENE] [OR : Staphylococcus 
carnosus] [DB : genpept-pat] [DErSequence 1 from Patent EP0805205.] [NT:unnamed protein 
product] [LE:7804] [RE:9381] [DIrdirect] >gp : [GI :g3 929523] [LN: AF029224] 
[AC:AF029224:AF029225] [PN:NarH] [GNmarH] [OR : Staphylococcus carnosus] 

[DB : genpept - be 1 2 ] [DE : Staphylococcus carnosus nir and nar operons, complete sequences.] 
[NT: similar to Escherichia coli nitrate reductases NRA] [LE:9989] [RE: 11566] 
[DI : direct] 



ORF Name 



IAI7503000985 4787807 ±3 984 



Description 
NO-HIT 



NTID 



7v7vTr ^ NT AA 

AAID — _ — , , score 

LengthLength 



probability 



|5030 |pTT3 1 



ORF Name 



E 



17503000985 4814682 13 1288 



Description 
NO-HIT 



NT AA 

NTID AAID — — , score probability 
LengthLength 



1259 



[5031 |pm 1 166 | 



ORF Name 



IAI7503000985 4870907 tl 195 



Description 
NO-HIT 



ORF Name 



IAI7503000985 4875316 ci 1543 



Description 
NO-HIT 



ORF Name 



| A17b030009flB_487S760Jbl_2BB 

Description 

NO-HIT 



NT AA 

NTID AAID — _^ — , , score probabi lity 
LengthLength c L - 



TT6TT 



|5032 [|231 | |76 | 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



[5033 | 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



1262 



[5034 [|135 | 



ORF Name 



XTmTt ^ NT AA 

NTID AAID — _ „. — , , score 

LengthLength 



probability 



IAI7503000985 4876387 c3 2095 



1263 I PTJTF 



Description 
NO-HIT 



[186 ||6i | 



ORF Name 



NT 



AA 



NTID AAID — , 



LengthLength 



score 



17S03000985 4876932 c2 1733 



|b036 |pT5~||82 I [74 | [ 



probability 
0 . 013 



Description 

gp: [GI:gl255424] [LN : CELC33G8] [AC:U53154] [GN:C33G8.3] [OR : Caenorhabditis elegans] 
[DB: genpept -invl] [DE : Caenorhabditis elegans cosmid C33G8.] [LE : 27678 : 27899 : 28087] 
[RE : 27848 : 27952 : 28290] [DI : direct Join] 
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ORF Name 



NTID 



NT 



AA 



AAID _ . , . 



Length Length 



score probability 



|Al7503000985_4878J12_c2_1648 

Description 

NO-HIT 



1265 | [5037 |pnr^| 46 | 



ORF Name 



NTID 



M „ NT AA 

AAID — — , , score 

LengthLength 



1A17503000985 4882893 c3 2012 



] [1266 | [5038 |p^TTT]|402 |JZ5T" 



probability 
II. Oe-27 



1 



Description 

gp: [GI:dl039027:g4512424] [LN: AB017508] [AC : AB017508] [GN: secY] [OR:Bacillus 
halodurans] [SRrBacillus halodurans (strain :C-125) DNA] [DB : genpept-bctl] [DErBacillus 
halodurans C-125 genomic DNA, 32 kb fragment, completecds . ] [NT : secY homologue 
(identity of 70% to B. subtilis )] [LE:22481] [RE:23773] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



E 



17503000985 4891002 c2 1834 



LengthLength probability 
j [1267 | [5039 |pTT]| 371 | pn~ || 5 . 8e-66 | 



Description 

pir: [LN:A43577] [AC:A43577 ] [PN: regulatory protein pfoR] [OR: Clostridium perfringens] 
[DB:pir2] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



IAI7503000985 4980378 cl 1635 



[5040 [[5T7~||178 ||578 ||4.2e-56 



Description 

gp: [GI:g4096799] [LN : SCU40158 ] [AC:U40158] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus response regulator- like protein 
(orfx)gene, partial cds . ] [NT:orfx; function unknown; similar to response] [LE:<1] 
[RE: 560] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID — — , 
LengthLength 



[AI7503000985 5117137 c2 1827 



] [1269 | [5041 |p^— |p37 



score probability 
| |499 || 9.8e-48 



Description 

pir: [LN:G70069] [AC:G70069 ] [PN:capsular polyglutamate biosynthesis homolog ywtA] 
[GN:ywtA ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84495 :g2636114] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [GN:ywtA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[NT: similar to capsular polyglutamate biosynthesis] [LE: 100949] [RE: 101398] 
[DI: complement] >gp: [GI : e308089 :gl894766] [LN : BSZ92954] [AC:Z92954] [GN : ywtA] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis yws [A, B, C, D, E, F, G] and gerBC 

genes.] [NT:product highly similar to Bacillus anthracis CapC] [LE:1084] [RE: 1533] 
[DI:direct] >gp: [GI : ell84495 : g2636114] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN : ywtA] 
[FN: unknown] [OR: Bacillus subtilis] [DB:genpept] [DE: Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT:similar to capsular polyglutamate 

biosynthesis] [LE : 100949] [RE : 101398] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



p750 



3000985 5128425 t2 821 



][ 



TT7TT 



[5042 |pF7F] 



probability 
|2.1e-45 



Description 

pir: [LN:F69848] [AC:F69848 ] [PN: transcription antiterminator BglG family homolog yjdC] 
[GN:yjdC ] [CL: phosphotransferase system mannitol -specific enzyme II factor III 

homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83220 : g2633554] [LN : BSUB0007] 
[AC:Z99110:AL009126] [GN:yjdC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 7 of 21): from 1194391to 1411140.] 
[NT: similar to transcriptional antiterminator (BglG] [LE: 75712] [RE: 77658] [DI: direct] 
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NT AA 

ORF Name NTID AAID Le ~ thXje ^ t h score probability 



|AI7B0300098i>J>13Ji>fe2_t2J>!>fe | [ 12 7 1 | [5043 || 222 1 | 73 | f[gp— . ie-l T 



Description 

pir: [LN:F70041] [AC:F70041 ] [PN:probable mercuric ion-binding protein yvgY iperiplasmic 
mercuric ion-binding protein merP homolog] [GN: yvgY ] [CL: mercuric resistance operon 
regulatory protein: heavy-metal -associated homology] [OR: Bacillus subtilis] [DB:pirl] 
>gp: [GI:ell86039:g2635864] [LN: BSUB0018] [AC : Z99121 : AL009126] [GNryvgY] [FN:unknown] 

[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
18 of 21): from 3399551to 3609060.] [NT:similar to mercuric transport protein] 

[LE:43125] [RE: 43334] [DI : complement] 



NT AA 

ORF Name NTID AAID — . — , _ score probabilit y 
L^ngthLength 



|&175030009fl5_5i36002_c2_lgSi | [1272 | [5044 | p^~| | 12i | | 136 | | 2.9e-09 

Description 

pir : [LN:C70043] [AC:C70043 ] [PN : hypothetical protein yvlA] [GN:yvlA ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell86201 :g2636026] [LN: BSUB0018] [AC : Z99121 : AL009126] 
[GN : yvlA] [FN:unknovm] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 18 of 21): from 3399551to 3609060.] [LE:207957] [RE:208283] 
[DI complement] >gp: [GI:ell84419:g2 636039] [LN:BSUB0019] [AC : Z99122 : AL009126] [GN: yvlA] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 19 of 21) : from 3597091to 3809700.] [LE:10417] [RE:10743] 
[DI: complement] >gp : [GI :g2618844] [LN: AF017113] [AC : AF017113] [PN:YvlA] [GN: yvlA] 
[OR:Bacillus subtilis] [DB : genpept-bct2] [DE:Bacillus subtilis 300-304 degree genomic 
sequence.] [LE:19550] [RE:19876] [DI:direct] >gp : [GI : ell84419 :g2636039] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [GN:yvlA] [FN:unknown] [OR:Bacillus subtilis] [DB:genpept] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[LE : 10417] [RE : 10743] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ T — , , score probabili ty 
LengthLength c z - 



AI7503u00985__5203452_c3_1989 | [1273 | [ 5045 |192 [[ 398 | | 5.0e-37 



Description 

sp: [LN : HIS5_LACLA] [AC:Q02132] [GN:HISH] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPLACTIS : STREPTOCOCCUS LACTIS] [EC:2.4.2.-] [DE : AMIDOTRANSFERASE HISH,] 
[SP:Q02132] [DB : swissprot] >pir : [LN : 145734] [AC:I45734 ] [PN:HisH] [CL : amidotransf erase 
hisH] [OR:Lactococcus lactis subsp. lactis] [DB:pir2] >gp : [GI : g2565145] [LN : LLU92974] 
[AC:U92974:M90760:M90761] [PN: HisH] [GN:hish] [OR : Lactococcus lactis] [DB : genpept-bctl] 
[DE : Lactococcus lactis unknown gene, partial cds, and HisC ( hi sC ), unknown, HisG (hisG) , 
unknown, HisB (hisB) , unknown, HisH (hish) ,HisA (hisA) , HisF (hisF) , HisIE (hisIE) , 
unknown, unknown, LeuA(leuA) , LeuB (leuB) , LeuC (leuC) , LeuD (leuD) , unknown, 
IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , IlvA (ilvA) , AldB (aldB)and aldR (aldR) 
genes, complete cds.] [NT : aminotransferase] [LE:7295] [RE:7903] [DI:direct] 

NT AA 

ORF Name NTID AAID thLe ~ th score probability 



AlVb0300098B_S2SlS6B_t3_1043 | [1274 | |b046 || 504 ~| | 167 | | 330 | | B.0e-30 



Description 

pir: [LN:C64897] [AC:C64897 ] [PN:probable phosphinothricin N-acetyltransf erase , ] 
[CL:phosphinothricin N-acetyltransf erase] [OR : Escherichia coli] [EC:2.3.1.-] [DB:pir2] 

>gp: [GI:gl787719] [LN : AE000241] [AC : AE000241 :U00096 ] [PN:putative resistance protein] 
[GN:bl448] [FN:putative transport; Drug/analog sensitivity] [OR : Escherichia coli] 
[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 131 of 400 of the 

completegenome. ] [NT:fl72; 38 pet identical (3 gaps) to 169 residues of] [LE:9570] 
[RE : 10088] [DI : complement] 
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NT AA 

ORF Name NTID AAID Leng^Lenath score probability 

|AI7!>030009flb_b2i)0feVb_cJ_210a | [ 12 7 5 | [ 5047 || 405 1 1134 |pi~| p.le-Ja — | 

Description 

sp: [LN:PAND_BACSU] [AC:P52999] [GN : PAND] [OR : BACILLUS SUBTILIS] [EC : 4 . 1 . 1 . 11] 
[DE : DECARBOXYLASE ) ] [SP:P52999] [DB : swissprot] >pir : [LN : A69672 ] [AC:A69672 ] 
[PNiaspartate 1- decarboxylase panD] [GN: panD ] [CL :aspartate 1 -decarboxylase] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gll46242] [LN : BACYPIA] [AC:L47709] 
[PNraspartate 1- decarboxylase] [GN :panD] [FN: pantothenic acid biosynthesis] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 4 . 1 . 1 . 11] [DE:Bacillus subtilis (clone 
YAC15-6B) ypiABF genes, qcrABC genes , ypjABCDEFGHI genes, birA gene, panBCD genes, dinG 
gene, ypmB gene,aspB gene, asnS gene, dnaD gene, nth gene and ypoC gene, 
completecds ' s . ] [NT: 48. 3% of identity to the Escherichia coli aspartate] [LE: 14990] 
[RE:15373] [DI: direct] >gp : [GI :ell83686:g2 634659] [LN:BSUB0012] [AC : Z99115 : AL009126] 
[PN:aspartate 1-decarboxylase] [GN:panD] [FN pantothenate biosynthesis] [OR:Bacillus 
subtilis] [DB : genpept-bctl] [EC : 4 . 1 . 1 . 11] [DE: Bacillus subtilis complete genome 
(section 12 of 21): from 2195541to 2409220.] [SP:P52999] [LE:156294] [RE:156677] 
[DI complement] 

NT AA 

ORF Name NTID AAID — , — , , score probab ility 
LengthLength c L 



AI7g03000Sfl6_Sn4077_ci_i448 | [1276 [ [ 5048 ||IggT]| 630 | pnF]| i . 2e-i04 



Description 

sp: [LN:YIDK_ECOLI] [AC:P31448] [GN : YIDK] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 62.1 KD 
PROTEIN IN EMRD-GLVG INTERGENIC REGION] [SP:P31448] [DB : swissprot] >pir: [LN:H65169] 
[AC:H65169 ] [PN : hypothetical 62.1 kD protein in ilvo-ibpb intergenic region] [GN:yidK 
] [OR: Escherichia coli] [DB:pir2] >gp : [GI :g290528] [LN:ECOUW82] [AC:L10328] [GN:f571] 
[FN:unknown] [OR : Escherichia coli] [SR Escherichia coli K12 strain MG1655; lambda 
clones EC14-52] [DB : genpept-bctl] [DE : E . coli; the region from 81.5 to 84.5 minutes.] 
[NT: similar to glucose transport proteins] [LE: 47795] [RE: 49510] [DI : complement] 
>gp: [GI:gl790113] [LN: AE000445] [AC : AE000445 :U00096] [PN:putative cotransporter] 
[GN:yidK] [FN:putative transport; Not classified] [OR : Escherichia coli] 
[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 335 of 400 of the 
completegenome . ] [NT:f571; 100 pet identical to YIDK_ECOLI SW:] [LE:5576] [RE:7291] 
[DI : complement] 

_ — — — — ^ — .iii..,,.,,,.,,,.,,,— , ii..,.,.,,.,;;,,..,— ,11. .1 ■ I .1111,1. — -a =E MJ- |.»M .11 M l— - >~^r_^. — — — — ,- _ ^» , . .. , ,. „„ 

NT AA 

ORF Name NTID AAID — — , , score probabil ity 
LengthLength 



AI750300098B_534i3_r2_855 | [1277 | [ 5049 ||TFuT]| 533 | p3~ || 8 . 7e-40 



Description 

pir: [LN:H64532] [AC:H64532 ] [PN : 2 ' , 3 1 -cyclic-nucleotide 2 ■ -phosphodiesterase , 
precursor] [GN:HP0104 ] [CL : 2 ' , 3 ' -cyclic-nucleotide 2 ' -phosphodiesterase : 
2 ', 3 ' -cyclic-nucleotide 2 ' -phosphodiesterase homology :phosphoesterase core homology] 
[OR Helicobacter pylori] [EC : 3 . 1 . 4 . 16] [DB:pirl] >gp : [GI :g2313187] [LN: AE000532] 
[AC:AE000532 :AE000511] [PN:2 ' , 3 1 -cyclic-nucleotide 2 1 -phosphodiesterase] [GN:HP0104] 
[OR : Helicobacter pylori 26695] [DB : genpept -bet 2 ] [DE Helicobacter pylori 26695 section 
10 of 134 of the complete genome.] [NT:similar to GB:L42023 SP:P44764 PID:1004048] 
[LE : 3594] [RE : 5339] [DI : complement] 
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ORF Name 



NT ID 



AAID 



NT 



AA 



Length Length 



score probability 



lA17b0300098b bi69212 c3 2196 



[5050 |p^TS~| [502 ||430 ||5.4e-40 



Description 

sp: [LN:TAGE_BACSU] [AC:P13484] [GN : TAGE : RODD : GTAA] [OR:BACILLUS SUBTILIS] [EC : 2 . 4 . 1 . 52 ] 
[DE:(EC 2.4.1.52) (TEICHOIC ACID BIOSYNTHESIS PROTEIN E) ] [SP:P13484] [DB : swissprot ] 
>pir : [LN:S06048] [AC : S06048 : F6972 0 ] [PN : poly (glycerol -phosphate) 
alpha-glucosyltransf erase, tagE : probable rodD protein :UDP-glucose- -polyglycerol 
phosphate glucosyltransf erase tagE] [GN : tagE : rodD ] [OR: Bacillus subtilis] [EC: 
2.4.1.52] [DB:pir2] [MP:310 degrees ] >gp : [GI :g580920] [LN: BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NT : rodD 
(gtaA) polypeptide (AA 1-673)] [SP:P13484] [LE:157] [RE:2178] [DI:direct] 
>gp: [GI:ell84479:g2636099] [LN : BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose : polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC: 2 . 4 . 1 . 52] [DE:Bacillus subtilis complete 

genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, 

gtaA, gtaD] [SP:P13484] [LE : 80369] [RE: 82390] [DI : complement] 

>gp: [GI:ell84479:g2636099] [LN : BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose : polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR:Bacillus subtilis] [DB:genpept] [EC : 2 . 4 . 1 . 52] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, gtaA, gtaD] 
[SP:P13484] [LE:80369] [RE:82390] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



|A17b03000985 54207 cl 1347 



] [1279 | [5051 ||138-| [45 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — _ T — , , score 
LengthLength 



probability 



lAI75030009«b b6693 cl 1418 



] [1280 | [5052 |[7?2"^|263 | [T7^~||2 . 7e-77 



Description 

gp: [GI:g2058476] [LN:BTU71200] [AC:U71200] [PN:acetoin reductase] [OR:Bos taurus] 
[DB:genpept-mam] [DE:Bos taurus acetoin reductase mRNA, complete cds . ] [NT:similar to 
acetoin reductase of Klebsiella] [LE:51] [RE: 824] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ T — , . score 
LengthLength 



probability 



|AI750300098b b78162 13 1226 



Description 
NO-HIT 



] eid F^n mp i 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IA17503000985 S867262 cl 1636 



[5054 |p^~| [41 | 



Description 
NO-HIT 



ORF Name 

|AI7503000385_58910 7 b_t2J 7 36 

Description 
NO-HIT 



NTID 



TT8T 



NT AA 

AAID — _ — , . score proba bility 
LengthLength c JL 



5TT5F" 



TJT 



334 



ORF Name 



lAl7b03000985 5907177 tl 392 



NT AA 
NT ID AAID — . _ — score 
Length Length 

[1284 | [5056 ||138 ||45 | 



probability 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



[AI7503000985_59i5653_t2_ 7 89 



Description 
NO-HIT 



probability 



1255 | [5057 |pi~| p~ 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000985 595250 c2 1956 



[5058 ||378TT| [1259 |p568||0.0 



Description 

gp: [GI:el429593:g4756151] [LN:A67161] [AC:A67161] [FN:NARG GENE] [OR : Staphylococcus 
carnosus] [DB :genpept-pat] [DErSequence 1 from Patent EP0805205.] [NT:unnamed protein 
product] [LE:4140] [RE:7814] [DI:direct] >gp : [GI :g3929522] [LN : AF02 9224] 
[AC:AF029224:AF029225] [PN:NarG] [GN:narG] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus nir and nar operons, complete sequences. 



] 



[NT: similar to Escherichia coli nitrate reductases NRA] [LE:6325] [RE: 9999] [DI: direct] 



ORF Name 



NTID 



AAID 



E 



17503000985 5953300 12 895 



NT AA 
LengthLength 



score 



TTF" 



probability 
5.6e-06 



Description 

gp : [GI : dl044460 :g5105361] [LN: AP000062] [AC : AP000062] [PN:213aa long hypothetical 
protein] [GN:APE1673] [OR : Aeropyrum pernix] [SR : Aeropyrum pernix ( strain :K1) DNA] 

[DBrgenpept] [DE : Aeropyrum pernix genomic DNA, section 5/7.] [LE: 71354] [RE: 71995] 

[DI :direct] 



ORF Name 



NTID 



NT 



AA 



AAID , , 

LengthLength 



score 



probability 



E 



17503000985 5988785 tl 391 



Description 
NO-HIT 



J [1288 | [5060 ||135 1 144 | 



ORF Name 



NT AA 
NTID AAID — _ , — _ score 
LengthLength 



probability 



IAI7503000 985 6047827 c2 1778 



] [1289 | [5061 ||705 ~||234 ||569 ||3.8e-55 



Description 

gp: [GI :g2565161] [LN : LLU92974] [AC : U92974 :M90760 :M90761] [PN : AldB] [GN: aldB] 
[OR:Lactococcus lactis] [DB : genpept-bctl] [DE : Lactococcus lactis unknown gene, partial 
cds, and HisC (hisC) , unknown, HisG (hisG) , unknown, HisB (hisB) , unknown, HisH 
(hish),HisA (hisA) , HisF (hisF) , HisIE (hisIE) , unknown, unknown, LeuA(leuA) , LeuB 
(leuB) , LeuC (leuC) , LeuD (leuD) , unknown, IlvD(ilvD), IlvB (ilvB) , IlvN, IlvC (ilvC) , 
IlvA (ilvA) , AldB (aldB) and aldR (aldR) genes, complete cds.] [NT:alpha-acetolactate 
decarboxylase] [LE:24488] [RE:25198] [DI:direct] >gp : [GI : gl699352] [LN:S82499] 
[AC:S82499] [PN : alpha-acetolactate decarboxylase] [GN: aldB] [OR: Lactococcus lactis] 
[SR: Lactococcus lactis NCD02118 ssp. lactis] [DB :genpept-bctl] 

[DE:aldB=alpha-acetolactate decarboxylase [Lactococcus lactis, ssp. lactis, NCD02118, 
Genomic, 840 nt].] [NT : This sequence comes from Fig. 2; AldB] [LE:98] [RE:808] 
[DI: direct] 



335 



ORF Name 



NT 



AA 



^ Le - thLe ~ th score probability 



LL7503000985 60709J8 el 1477 



] [1290 | [5062 ||1773||590 | p7B"||4 . Oe-257 



Description 

gp: [GI :g4574121] [LN : AF009415] [AC : AF009415] [PN:choline dehydrogenase] [GNicudB] 
[OR: Staphylococcus xylosus] [DB : genpept-bct2] [DE : Staphylococcus xylosus choline 
transporter (cudT) , putativeregulatory protein (cudC) , glycine betaine aldehyde 
dehydrogenase (cudA) , and choline dehydrogenase (cudB) genes, complete cds.] [NT : CudB] 
[LE:4915] [RE:6597] [DIrdirect] 



ORF Name 



O17503000985 6101063 c2 1925 



Description 
NO-HIT 



ORF Name 



E 



17503000985 6101542 r2 836 



Description 
NO-HIT 



NT AA 
NTID AAID — — ^ score 
LengthLength 



probability 



] [1291 | [5063 |pg—| p 



NT AA 
NTID AAID — _^ — , - score 
LengthLength 



probability 



132 IPO" 



ORF Name 



1AI7503000985 6132893 c3 2259 



NTID 



n ^„ r NT AA 

AAID , , 

LengthLength 



score 



probability 



T5T 



Description 
NO-HIT 

ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



AI7503000985 6136050 c2 1807 



][ 



[5066 |pT5~||81 ||80 ||0.0033 



Description 

gp: [GI:g4980536] [LN : AE001692] [AC : AE001692 : AE000512] [PN: iron (II) transport protein A] 
[GN:TM0050] [OR : Thermo toga maritima] [DB : genpept - be t 2 ] [DE : Thermo toga maritima section 

4 of 136 of the complete genome.] [NT:similar to SP:P33649 PID:414746 PID:606343] 
[LE:81] [RE: 545] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — — . , score 
LengthLength 



^17503000985 6136527 cl 1463 



][ 



fT2U!T 



[5067 |pT5 — 1 1204 |p3" 



probability 
12 .4e-60 



Description 

pir: [LN:C69877] [AC:C69877 ] [PN : adenylylsulf ate kinase homolog ylnC] [GN:ylnC ] 
[CL:adenylylsulfate kinase : adenylylsulf ate kinase homology] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI:e332186:g2462959] [LN : BSPYREYLO] [AC : AJ000974] [PN:putative adenosine 
5-phosphosulfate kinase] [GN:ylnC] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis pyrE to yloA gene region.] [LE:3535] [RE:4128] [DI:direct] 
>gp: [GI:ell85152:g2633933] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:ylnC] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to adenylylsulf ate kinase] [LE:34348] 
[RE:34941] [DI :direct] 



ORF Name 



IAI7503000985 6251592 c2 1940 



Description 
NO-HIT 



NTID AAID — — score probability 



NT AA 
Length Length : 



336 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th gi^ probability 



A17i>0300058b_fe2«90fe^c2_1750 | [ 1297 | [ 5069 p |p jj || 1 . Oe-IB ~] 



Description 

gp: [GI:g4574118] [LN: AF009415] [AC : AF009415] [PN:choline transporter] [GN: cudT] 
[OR: Staphylococcus xylosus] [DB : genpept-bct2] [DE : Staphylococcus xylosus choline 
transporter (cudT) , putativeregulatory protein (cudC) , glycine betaine aldehyde 
dehydrogenase (cudA) , and choline dehydrogenase (cudB) genes, complete cds.] [NT : CudT] 
[LE:811] [RE:2433] [Dlidirect] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



p7503Q00985_g5g9588_i2_5g^ | [1298 | [5070 || 9iS ~| | 304 | p^u~| |^ . 9e-05 

Description 

gp: [GI:g4262225] [LN: ATAC0062 00] [AC : AC006200] [PN:putative phosphatidic acid 
phosphatase] [GN:F10A8.6] [OR : Arabidopsis thaliana] [SRtthale cress] [DB : genpept-pln2] 

[DE:Arabidopsis thaliana chromosome II BAC F10A8 genomic sequence, complete sequence.] 

[LE : 16778] [RE : 17686] [DI : complement] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000985_5442202_i3_lQ33 | |1299 | [5071 |puTu"| |349 ||3i0 ||rTT7e^T7 



Description 

Sp : [LN : YYAD_BACSU] [AC:P37520] [GN : YYAD] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 37.7 
KD PROTEIN IN RPSF-SPOOJ INTERGENIC REGION] [SP:P37520] [DB : swissprot] >pir : [LN : 140448] 
[AC: 140448 :S66018 :C70084 :S18084 ] [PN : conserved hypothetical protein yyaD (replication 
origin region)] [GN:yyaD ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :dl005766 :g467378] 
[LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE: 51420] [RE: 52436] [DI : complement] 
>gp: [GI:g580907] [LN:BSORIGS] [AC:X62539] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genes rpmH, rnpA, 50kd, gidA and gidB.] [NT: unnamed protein product] 
[SP:P37520] [LE:10964] [RE:11980] [DI:direct] >gp : [GI : ell8482 0 : g2636641] [LN : BSUB002 1] 
[AC:Z99124 :AL009126] [GN:yyaD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 21 of 21): from 3999281to 4214814.] 
[NT:similar to hypothetical proteins from B. subtilis] [SP:P37520] [LE:203352] 
[RE: 204368] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ T — , , score probabili ty 
LengthLength c *- 



6.7e-33 



|AI7B030009Bb_6S1627_cljL369 | [1300 | [ 5072 || 732 — 1 | 243 

Description 

sp: [LN:HIS4_SYNY3] [AC:P74561] [GN : HISA : SLR06 52] [OR : SYNECHOCYSTIS SP] [SR:PCC 6803,] 
[EC: 5. 3. 1.16] [DE : ISOMERASE, ] [SP:P74561] [DB : swissprot] >pir : [LN : S76 756] [AC:S76756 ] 
[PN: hypothetical protein] [CL:N- (5 ' -phospho-D-ribosylf ormimino) -5-amino-l- 
(5 1 ' -phosphor ibosyl) -4- imidazolecarboxamide isomerase] [OR : Synechocystis sp.] [SR:PCC 
6803, , PCC 6803] [SR:PCC6803, ] [DB:pir2] >gp : [GI : dl019401 : gl653757] [LN:D90916] 
[AC:D90916 :AB0013 39] [PN :phosphorybosilf ormimino- 5 -amino-] [GN: hisA] [OR : Synechocystis 
sp.] [SR: Synechocystis sp. (strain: PCC6803) DNA] [DB : genpept-bctl] [DE : Synechocystis 
sp. PCC6803 complete genome, 26/27, 3270710-3418851.] [NT : ORF_ID : slr0652] [LE:48572] 
[RE:49342] [DI:direct] 

NT AA 

ORF Name NTID AAID — _ — , . score probabili ty 



AI750300098!i_6522252_c2_i805 

Description 

NO -HIT 



LengthLength - 
[1301 | [5073 [|TT8~1|4g | 



337 



ORF Name 



F 



1750300098b 6642792 ti 978 



Description 
NO-HIT 



NT AA 
NT ID AAID ^ — „ — score 
LenqthLength 



probability 



[1302 | [5074 ||147— |p- 



ORF Name 



117503000385 6662875 hi 382 



Description 
NO-HIT 



NTID 



11303 



„ T „ NT AA 

AAID — , — , , score 

LengthLength 



probability 



ITT 



T8" 



ORF Name 



|Al750300098b 6678507 tl 253 



Description 
NO-HIT 



ORF Name 



IAI7503000985 5745327 ±3 1336 



Description 
NO-HIT 

ORF Name 



IAI7503000985 5772250 cl 1380 



Description 
NO -HIT 



ORF Name 



U750300098b 6829635 c2 1695 



Description 
NO-HIT 



NT AA 

NTID AAID Le ^ thLe ~ th score probability 



] [1304 | [5076 |pT7~ ||68 | 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



[1305 | [5077 |[T?3~ ||47 | 



NTID AAID 



NT 



AA 



B7771T 



— , v_ T — , , score probabili ty 
LengthLength : — • L - 

WE — 



TTT" 



NT AA 

NTID AAID — — . , score proba bility 
LengthLength c 



"j [1307 | [ 5079 |p^~|| 43 | 



ORF Name 



NTID 



— . _ NT AA 
AAID — — , score 
Length Length 



probability 



IAI7503000985 6829812 cl 1542 



] [1308 | [5080 1|315 | p^B — 1|4 . 2e-79 



Description 

gp: [GI:g4959404] [LN : AF1153 91] [AC : AF115391] [PN : ribokinase RbsK] [GNrrbsK] 
[OR: Lactobacillus sakei] [DB :genpept-bct2] [DE : Lactobacillus sakei LaaA (laaA) gene, 
partial cds; LaaB (laaB) , putative acetate kinase AckA (ackA) , LaaC (laaC) genes, 
completecds; rbs operon, complete sequence; and LaaE (laaE) gene, partialcds.] 
[LE:4450] [RE:5358] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IAI7503000985 6837938 tl 376 



] [1309 | [5081 |p5gT]|1186 ||198 ||2.1e-14 



Description 

sp: [LN:CDR1_HUMAN] [AC:P51861] [GNrCDRl] [OR: HOMO SAPIENS] [SR:, HUMAN] 
[DE : CEREBELLAR -DEGENERATION- RELATED ANTIGEN 1 (CDR34 ) ] [SP:P51861] [DB : swissprot] 
>pir: [LN:A29770] [AC :A2 9770: A3 5640 ] [PN : cerebellar degeneration-related protein] 
[GN : CDR1 : CDR ] [OR: Homo sapiens] [SR:, man] [DB:pir2] [MP : Xq2 7.1- Xq2 7 . 2 ] 



338 



NT AA 

ORFJJame NTID AAID Le ~ thLe ~ th probability 



|AJ.7b0a0009flb>044012_c2_174B | [1310 | [ 5082 || 1344 j | 447 || t>B9 | | 2.9e-b7 

Description 

sp: [LN:DCUA_ECOLI] [AC:P04539] [GN : DCUA : GENA] [OR : ESCHERICHIA COLI] [DE : ANAEROBIC 
C4 - D I CARBOX YLATE TRANSPORTER DCUA] [SP:P04539] [DB : swissprot] >pir: [LN:QQEC94] 

[AC:S56366:S57340:H65223:A04471:S08589 ] [PN : dicarboxylate membrane- transporter protein 
A: anaerobic c4 -dicarboxylate transporter dcua : aspartase membrane transport protein 
genA] [GN:dcuA:genA ] [CL : dicarboxylate membrane -transporter protein A] [OR : Escherichia 
coli] [DBrpirl] [MP:94min] >gp : [GI :g510888] [LN: ECDUCA] [AC:X79887] [PN : dicarboxylate 
membrane- transporter protein] [GN : dcuB] [OR : Escherichia coli] [DB :genpept-bctl] 

[DE :E. col i dcuA gene.] [SP:P04539] [LE:129] [RE:1430] [DIrdirect] >gp : [GI:g536982] 

[LN:ECOUW93] [AC:U14003] [GN:genA] [FN:membrane transport of aspartase] [OR: Escherichia 
coli] [DB:genpept-bctl] [DE : Escherichia coli K-12 chromosomal region from 92.8 to 00.1 
minutes.] [LE:56300] [RE:57601] [DI : complement] >gp : [GI : gl790580] [LN:AE000486] 

[AC:AE000486 :U00096] [PN: anaerobic dicarboxylate transport] [GN : dcuA] [FN : transport ; 
Transport of small molecules:] [OR : Escherichia coli] [DB : genpept-bct2] [DE : Escherichia 
coli K-12 MG1655 section 376 of 400 of the completegenome . ] [NT:f433; 100 pet identical 
amino acid sequence and] [LE:6845] [RE: 8146] [DI : complement] 

NT AA 

ORE Name NTID AAID Le ~ thLe ~ th score probability 



Ai:76030005aS_6a99133_ti_4iS | |i311 | [ 5083 | |T?r~| |4i || 75 | | 0.038 



Description 

gp: [GI:g5306168] [LN : AF160864] [AC : AF160864] [PN:orf256] [GN:orf256] [OR : Mitochondrion 
Tetrahymena pyriformis] [SR : Tetrahymena pyriformis] [DB:genpept] [DE : Tetrahymena 
pyriformis mitochondrial DNA, complete genome.] [NT:Open reading frame ymf62 (CPGN) ] 
[LE:40866] [RE:41636] [DIrdirect] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A17503000985_692B_ci_ib46 | [1312 | [5084 |[MT~| [ 280 | 



Description 
NO-HIT 



339 



ORF Name 



NT 



AA 



™ ^ LengthLength probability 



lAI750300098b 70J90S1 c3 2238 



TTTT" 



[5085 ||75(r~||249 [|585 [|7.6e-57 



Description 

sp: [LN:YCKJ_BACSU] [AC:P42200] [GN : YCKJ] [OR:BACILLUS SUBTILIS] [DE: PROBABLE AMINO -AC ID 
ABC TRANSPORTER PERMEASE PROTEIN] [SP:P42200] [DB : swissprot ] >pir : [LN : 140451] 
[AC: 140451 : D69761 : S523 82 ] [PN : glutamine ABC transporter (permease) homolog 
yckJ: ABC- type transport system probable membrane spanning protein] [GN:yckJ ] 
[CLrhistidine permease protein M] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :g666982] 
[LN : BSPAAT] [AC:X77636] [PN:putative membrane spanning subunit] [GN:ORF2] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis putative amino acid transporter gene.] 
[NT:potential ABC- transport system] [SP:P42200] [LE:909] [RE:1613] [DI:direct] 
>gp: [GI :ell82312 :g2632646] [LN : BSUB0002 ] [AC : Z99105 : AL009126] [GN:yckJ] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis complete genome (section 
2 of 21): from 194651 to4l5810.] [NT:similar to glutamine ABC transporter (permease)] 
[SP:P42200] [LE:214896] [RE:215600] [D I : complement] >gp : [GI :ell82327:g2 632661] 
[LN:BSUB0003] [AC : Z99106 : AL009126] [GN:yckJ] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 3 of 21) : from 402751 
to611850.] [NT: similar to glutamine ABC transporter (permease)] [SP:P42200] [LE:6796] 
[RE:7500] [DI : complement] >gp: [GI : dl009628 : gl8 05431] [LN:D50453] [AC:D50453] 
[PN:homologue of glutamine transport system] [GN:yckJ] [OR: Bacillus subtilis] 
[SR:Bacillus subtilis (strain:168 trpC2) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 
DNA for 25-36 degree region containing theamyE-srfA region, complete cds.] [LE : 89199] 
[RE: 8 9903] [DI : complement ] 



ORF Name 



NT AA 
NTID AAID — _ T — _ sc ore 
LengthLength 



probability 



U7503000985 7227175 c3 2052 



3 | 1314 | |5086 |fT55 | [ST 



Description 
NO-HIT 



ORF Name 



NTID AAID 



IAI7503000985 7228517 c3 2197 



NT AA 
LengthLength 
] | P 87 ||4il | \TJZ 



score 



probability 
|4.3e-15 



Description 

pir: [LN:A70556] [AC:A70556 ] [PN:probable mutator MutT protein or homolog] [GN:mutT2 ] 
[CL:mutT domain homology] [OR Mycobacterium tuberculosis] [DB:pir2] 

>gp: [GI:e317133:g2117198] [LN:MTCI65] [AC : Z95584 : AL123456] [PN:mutT2] [GN : mutT2] 
[OR : Mycobacterium tuberculosis] [DB : genpept-bctl] [DE : Mycobacterium tuberculosis H37Rv 
complete genome; segment 50/162.] [NT:Rvll60, (MTCI65.27), mutT, len: 141. Probable] 
[LE:24794] [RE:25219] [DI:direct] 



ORF Name 



IAI7503000985 7234627 ci 1580 



Description 
NO-HIT 



NTID 



7v7vtt ^ NT AA 

AAID — , — , . score 

LengthLength 



probability 



] [1315 | [5088 |pTu~ ip | 



ORF Name 

IAI7503000985 7242128 c2 1925 



NTID 
11317 



NT AA 
AAID — — 

LengthLength 

[5089 j rTCB I 



score 



fTTT" 



probability 
|3.7e-32 



Description 

gp: [GI:e314909:g2073521] [LN : SSK1MECA] [AC:Y09223] [PN : hypothetical protein] 
[OR staphylococcus sciuri] [DB : genpept-bctl] [DE:S.sciuri mecA gene & ORF's 450, 
179.] [NT:ORF450] [LE:<1] [RE: 1351] [DI : complement ] 



145 & 



340 



ORF Name 



NT 



AA 



NTID AAIP Le -- thLe - th g^ probability 



U.7b0300098b c2 1644 



j [1318 | [5090 |[T7?2~j|573 ~~||^M 4 p.4e-57 



Description 

gp: [GI:dl044599:g5105500] [LN : AP000062] [AC : AP000062] [PN:558aa long hypothetical 
protein] [GN:APE1810] [OR: Aeropyrum pernix] [SR : Aeropyrum pernix (strain: Kl) DNA] 
[DB:genpept] [DE : Aeropyrum pernix genomic DNA, section 5/7.] [NT : mot if =ATP/GTP- binding 
site motif A (P-loop)] [LE:155809] [RE:157485] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



IAI7S0300098S 7245377 ci 1534 



TJTT 



[5091 ||i0gT]|333 | [ 



probability 
3.4e-22 



Description 

sp: [LN:EST_ACICA] [AC:P18773] [GN : EST] [OR : ACINETOBACTER CALCOACETICUS] [EC:3.1.1.-] 
[DE : ESTERASE, ] [SP:P18773] [DB : swissprot] >gp : [GI:g303953] [LN: ACCESTERAS] 
[AC :M24890 :L20754] [PN : esterase] [OR : Acinetobacter calcoaceticus] [SR : Acinetobacter 
calcoaceticus ssp. lwoffii (strain RAG-1) DNA] [DB :genpept-bctl] [DE : Acinetobacter 
calcoaceticus esterase gene, complete cds.] [LE:1] [RE: 912] [DI:direct] 



ORF Name 



i!750300098b 7301078 c3 2006 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



probability 



1320 | |b092 |p^~||154 |[340 ||7.0e-3i 



Description 

gp: [GI:g3114664] [LN : AF061267] [AC : AF061267] [PN: ATPase component HtxD] [GN : htxD] 
[OR : Pseudomonas stutzeri] [DB :genpept-bct2] [DE : Pseudomonas stutzeri putative 
alpha-ketoglutarate-dependenthypophosphite dioxygenase (htxA) , binding protein 
component HtxB(htxB), inner membrane component HtxC (htxC) , ATPase component 
HtxD(htxD), inner membrane component HtxE (htxE) , putative C-P lyasesubunits HtxF 
(htxF) , HtxG (htxG) , and HtxH (htxH) genes, completecds ; and putative C-P lyase subunit 
Htxl (htxl) gene, partial cds.] [NT:putative binding-protein-dependent hypophosphite] 
[LE:3996] [RE:5021] [DI: direct] 



ORF Name 



|AI750300098b 78337b t2 893 



Description 
NO-HIT 



NT AA 
NTID AAID — . w — , _ score 
Length Length 



probability 



J [1321 | |b093 |[TTr~||48 | 



ORF Name 



E 



1750300098b 784387 c3 1970 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



TTZT 



|b094 |p^F]|474 |[731~||i.3e-77 



Description 

pir: [LN:G71641] [AC:G71641 ] [PN: dihydrolipoamide dehydrogenase (pdhD) RP805] 
[GN:pdhD:RP805 ] [CL : dihydrolipoamide dehydrogenase : dihydrolipoamide dehydrogenase 

homology] [OR : Rickettsia prowazekii] [DB:pir2] >gp : [GI : el343076 : g3861332] [LN:RPXX04] 
[AC:AJ235273 :AJ235269] [PN : DIHYDROLIPOAMIDE DEHYDROGENASE (pdhD) ] [GN:RP805] 
[OR Rickettsia prowazekii] [DB : genpept-bctl] [DE Rickettsia prowazekii strain Madrid E, 

complete genome; segment4/4.] [LE:137655] [RE:139034] [DI : complement] 



341 



ORF Name 



NT 



AA 



^ AAIP Le - thLe - th ^e probability 



IAI7503000985 78V6V7 f2 636 



1323 | [5095 ||483 1 1160 ||246 ||5.4e-21 



Description 

gp: [GI :g2735506] [LN : SCU96107] [AC:U96107] [PNrSceB precursor] [GNisceB] 
[OR: Staphylococcus carnosus] [DB .-genpept -be t2] [DE : Staphylococcus carnosus 
N5,N10-methylenetetrahydromethanopterinreductase homolog, SceB precursor (sceB) and 
putative transmembraneprotein genes, complete cds, and putative Na+/H+ antiporter 
NhaC(nhaC) gene, partial cds.] [NT:major secreted protein] [LE:1894] [RE:2685] 
[DI : direct] 



ORF Name 



NT AA 
NT ID AAID — _ „ — ^ score 
LerigthLength 



IAI7503000985 798838 c3 1587 



][ 



probability 
[5095 |p^TT[|4i9 ||S45 ||2.ie-84 



Description 

pir: [LN:E70368] [AC:E70368 ] [PN : histidinol dehydrogenase] [GNrhisD ] [CL : histidinol 
dehydrogenase : histidinol dehydrogenase homology] [ORrAquifex aeolicus] [DB:pir2] 
>gp: [GI:g2983343] [LN: AE000707] [AC : AE000707 : AE000657] [PN : histidinol dehydrogenase] 
[GN: hisD] [OR:Aquifex aeolicus] [DB : genpept -bct2 ] [DE:Aquifex aeolicus section 39 of 
109 of the complete genome.] [LE:5716] [RE: 6996] [DI : complement] 



ORF Name 



NT ID 



E 



17503000985 823518 tl 169 



Description 
NO-HIT 



][ 



TJ7T 



NT AA 

AAID , , 

LengthLength 

|5097 j[z?5 [ rr$ 



score probability 



ORF Name 



NT ID 



„ T „ NT AA 

AAID — , T — , , score 

LengthLength 



probability 



IA17503000985 832561 c3 2102 



1326 j [5098 ||TT3 1 



Description 
NO-HIT 



ORF Name 



NT ID 



AAID 



IAI7503000985 86088 f2 835 



TTZT 



NT AA 
LengthLength 



score 



T7T 



T5T" 



probability 
3.6e-ll 



Description 

gp: [GI:g2735504] [LN : SCU96107] [AC:U96107] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus 

N5,N10-methylenetetrahydromethanopterinreductase homolog, SceB precursor (sceB) and 
putative transmembraneprotein genes, complete cds, and putative Na+/H+ antiporter 
NhaC(nhaC) gene, partial cds.] [NT:Orfl] [LE:<1] [RE: 185] [DI : complement] 



ORF Name 



NT 



AA 



NT ID AAID — , _ — , score proba bility 
LengthLength c JL 



|AI7503000985_860917_t3J.170 



] [1328 | [5100 |pPT — 1 166 | [55 1|0.0095 



Description 

pir: [LN:A71605] [AC:A71605 ] [PN:probable integral membrane protein PFB0845w] 
[GN:PFB0845w ] [OR : Plasmodium falciparum] [DB:pir2] >gp : [GI : g3845291] [LN : AE001420] 
[AC:AE001420 :AE001362] [PN:predicted integral membrane protein] [GN: PFB0845w] 
[OR: Plasmodium falciparum] [SR:malaria parasite P. falciparum] [DB : genpept -inv2] 
[DE : Plasmodium falciparum chromosome 2, section 57 of 73 of thecomplete sequence.] 
[NT:predicted by GlimmerM] [LE:9688] [RE: 10611] [DI: direct] 



342 



ORF Name 



NT AA 

NTID AAID kQ^tfrkength score probability 



U7b0300098b UfcV^tbb tl 2ib 



"| [1329 | [5101 |p^~~[ |5Q [ 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 

^2 L .^ tbLe ^ th g°°^ probability 



E 



17503000985 892143 c2 1927 



5TuT" 



ST" 



Description 
NO-HIT 

ORF Name 



NT AA 

NTID AAID ^ — _^ — _ score probability 
LengthLength *■ 



F 



17503000985 891000 c2 1906 



Description 
NO-HIT 



"| [1331 | [5103 |[TTr~||48 | 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



IAI7503000985 895253 c3 2019 



[5104 |pu~ ||159 |p3~ ||5.8e-27 



Description 

pir: [LN:E71960] [AC:E71960 ] [PN:probable peptide methionine sulfoxide reductase] 
[GN:jhp0210 ] [OR: Helicobacter pylori] [SR:strain J99, , strain J99] [SR:strain J99, ] 
[DB:pir2] >gp: [GI:g4154731] [LN: AE001459] [AC : AE001459 : AE00143 9] [PN:putative PEPTIDE 

METHIONINE SULFOXIDE REDUCTASE] [GN : j hp0210] [OR : Helicobacter pylori J99] 
[DB:genpept-bct2] [DE : Helicobacter pylori, strain J99 section 20 of 132 of the 

completegenome . ] [NTrsimilar to H. pylori 26695 gene HP0224] [LE:4882] [RE:5961] 
[DI: direct] 



ORF Name 



NTID AAID _ — 



F 



17503000985 901377 c2 1785 



TUT 



^TOT- 



NT AA 
Length Length 



score 



probability 



Description 
NO-HIT 

ORF Name 



NT AA 

NTID AAID — — L score 

LengthLength 



probability 



lAI750300098b_901377__t2_596 



Description 
NO-HIT 



"| [1334 | [5105 |pg~ ||42 | 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength c 

AI7503000985_953142_tl_2i9 1 [1335 | [ 5107 ||55g— 1 1 317 | p5~ || 5 . ie-19 | 

Description 

sp: [LN:YOBS_MYCTU] [AC:Q50648] [GN : MTCY227 . 28C] [OR : MYCOBACTERIUM TUBERCULOSIS] 
[DE: HYPOTHETICAL 26.2 KD PROTEIN CY227.28C] [SP:Q50648] [DB : swissprot] >pir : [LN:D70724] 
[AC:D70724 ] [PN : hypothetical protein Rv2573] [GN:Rv2573 ] [OR Mycobacterium 
tuberculosis] [DB:pir2] >gp: [GI : e256179 : gl478239] [LN:MTCY227] [AC : Z77724 : AL123456] 
[PN: hypothetical protein Rv2573] [GN:Rv2573] [OR : Mycobacterium tuberculosis] 
[DB:genpept-bctl] [DE Mycobacterium tuberculosis H37Rv complete genome; segment 
114/162.] [NT:Rv2573 / (MTCY22 7 . 28c) , len: 246. Unknown but some] [SP:Q50648] [LE:5598] 
[RE: 6338] [DI:direct] 



343 



ORF Name 



lA1 7 50300098b 9b39J0 F2 838 



Description 
NO-HIT 



NT 



AA 



NTID AAID Le - thLe - th g£gre probability 



] P 36 | | 510fl |P 1 



ORF Name 



[AI7503000985 961562 ci 1401 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 
\TTTT 



ORF Name 



vI750300098b 968800 c2 1796 



NTID AAID 



NT 



AA 



Len^thLen^th ^^ probability 



] | 1338 1 ||gP^ — 1 1267 |[TTrn-||4.2e-104 



Description 

gp: [GI:e244971:gl340128] [LN:SA1234] [AC:X97985] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE:S. aureus orfs 1,2,3 & 4.] [NT : ORF1] [LE:537] [RE:1304] 
[DI -.direct] 



ORF Name 



IM7503000985 9766375 tl 90 



Description 
NO-HIT 



NT AA 

NTID AAID — ., — . . score probability 
LengthLength c L - 



] |1339 | [5111 j |5T 



ORF Name 



E 



1750300098b 9770801 t2 621 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



1340 | jb!12 |[I38~1|4b | 



ORF Name 



|AI750300098b 978965 c3 2055 



Description 
NO-HIT 



NTID AAID 



NT AA 



LengthLength probability 



] [1341 | [5113 | pT3~|| 70 | 



ORF Name 



p75U 



3000985 986312 Tl 447 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



] [1342 | [5114 |[T55— 1|55 | 



344 



ORF Name 



NT AA 
NTIP AAID — » _ — score 
Length Length 



probability 



U.Vb0300096b tfUVbJJi el lbB2 



J [1343 | |5il5 j|T53T"| |510 ~j [1134 ||5 . le-liS 



Description 

sp: [LN:DHAL_VIBCH] [AC:P23240] [GNrALDA] [OR : VIBRIO CHOLERAE] [EC:1.2.1.3] [DE : ALDEHYDE 
DEHYDROGENASE J [SP:P23240] [DB : swissprot] >gp : [GI :gl55276] [LN : VIBTAGALDA] [AC : M60658] 
[PN: aldehyde dehydrogenase] [GNraldA] [OR: Vibrio cholerae] [SR:V. cholerae DNA] 
[DB:genpept-bctl] [EC: 1.2. 1.3] [DE: Vibrio cholerae aldehyde dehydrogenase gene, 
complete cds, and tagAgene, 5' end.] [LE:419] [RE:1939] [DI:direct] >gp : [GI : g3004925] 
[LN:AF034434] [AC : AF034434] [PN:aldehyde dehydrogenase] [GNraldA] [OR:Vibrio cholerae] 
[DB:genpept-bct2] [DE:Vibrio cholerae pathogenicity island, putative 
transposase, aldehyde dehydrogenase (aldA) , toxR- activated gene A protein ( tagA) , 
putative inner membrane protein, and putative zincmetalloprotease genes, complete cds; 
and toxR-activated gene Dprotein (tagD) gene, partial cds.] [NT: AldA] [LE:2388] 
[RE : 3908] [DI : complement] 



ORF Name 



IAI7503000985 9928200 tl 189 



Description 
NO-HIT 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



[1344 | |5116 Iflgg | j52T 



ORF Name 



E 



17503000985 9944635 t2 631 



Description 
NO-HIT 



NTID 



_ _ NT AA 

AAID — _ T — , , score 

LengthLength 



probability 



1345 | |bli7 ||44 | 



ORF Name 



lAI7b0300098b 995300 tl 28 



NTID 



™™ NT AA 

AAID — — , score 

LengthLength 



probability 



] I 1345 I | 511fl IP 7 "!! 67 IP 1|0-048 



Description 

gp: [GI:gll73895] [LN: PFU41075] [AC:U41075] [PN:p82] [GN:RAP-1] [OR : Plasmodium 
falciparum] [SR:malaria parasite strain=IndJ- 1 (clone 1 of Indian isolate J)] 
[DB:genpept-inv2] [DE : Plasmodium falciparum strain IndJ-1 rhoptry-associated protein 
l(RAP-l) gene, partial cds.] [NT : rhoptry-associated protein 1; localized to rhoptry] 
[LE:<1] [RE:>461] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ „ — _ score 
LengthLength 



probability 



|M!7503000985_59540i2_tl M 333 



Description 
NO-HIT 



1347— | [5119 |pn— |[4S | 



345 



NT AA 

ORF Name NTID AAID Le ^ thIie ^ t h score probability 
^lVb0^00098^10iiViOV_t2_121 | [1348 | [ 5120 | |26 7 1 1 88 | [8T | |0.016 | 

Description 

sp: [LN:TAGB_BACSU] [AC:P27621] [GN : TAGB] [OR : BACILLUS SUBTILIS] [DE:TEICHOIC ACID 
BIOSYNTHESIS PROTEIN B PRECURSOR] [SP:P27621] [DB : swissprot] >pir : [LN : C49757] 
[AC :C4 9757 :C6 972 0 ] [PN : polyglycerol phosphate techoic acid biosynthesis protein tagB ] 
[GN:tagB ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl43725] [LN : BACTAGABCD] 
[AC:M57497] [GN:tagB] [ORrBacillus subtilis] [SR : B . subtilis (strain 168) DNA] 
[DB:genpept-bctl] [DE :B . subtilis tagA, tagB, tagC and tagD genes, complete cds . ] 
[NTrputative] [LE:1603] [RE:2748] [DI:direct] >gp : [GI :ell84482 :g2 636102] [LN: BSUB0019] 
[AC : Z99122 : AL009126] [GN:tagB] [FN : polyglycerol phosphate assembly and export] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [SP:P27621] [LE:84143] [RE:85288] [DI:direct] 
>gp: [GI:ell84482 :g2636102] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN:tagB] 
[FN: polyglycerol phosphate assembly and export] [OR: Bacillus subtilis] [DB:genpept] 
[DE:Bacillus subtilis complete genome (section 19 of 21): from 3597091to 3809700.] 
[SP:P27621] [LE:84143] [RE:85288] [DI:direct] 

NT AA 

ORF Name NTID AAID — _ „ — ^ score probability 
Length Length c £ - 



IAI750300098S 10625525 hi 58 



1349 | |5i21 |pu^T]p40 ||8i0 ||i.ie-fl0 



Description 

pir: [LN:E69831] [AC:E69831 ] [PN : conserved hypothetical protein yhfP] [GN:yhfP ] 
[CL: Bacillus subtilis conserved hypothetical protein yhfP] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : el 183034 :g2 633368] [LN: BSUB0006] [AC : Z99109 : AL009126] [GN:yhfP] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 6 of 21): from 999501 tol209940.] [NT:similar to hypothetical proteins] 
[LE:106496] [RE:107488] [DI:direct] >gp : [GI : e324995 : g2226252] [LN: BSY14084] [AC:Y14084] 
[PN: hypothetical protein] [GN:yhfP] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE :B. subtilis chromosomal DNA, region 78-80 degrees: aprE to comK.] [NT:high 

similarity to yhdH from E.coli (Swiss Prot] [LE:924] [RE: 1916] [DI: direct] 



NT AA 

ORF Name NTID AAID — — , , score probabi lity 
LengthLength 



|AI7503000385_10976g2b_ci_226 | [1350 | [5122 |p^~|| 295 || 870 || 4.8e-87 



Description 

sp: [LN:YHXD_BACSU] [AC : P40398 : 007554] [GN : YHXD] [OR:BACILLUS SUBTILIS] [EC:1. -.-.-] 
[DE:(EC 1.-.-.-) (ORFY) ] [SP : P40398 : 007554] [DB : swissprot ] >pir : [LN : E69835] 
[AC:E69835:S43612 ] [PN:ribitol dehydrogenase homolog yhxD] [GN :yhxD ] [CL : short -chain 
alcohol dehydrogenase homology] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell83045:g2633379] [LN : BSUB0006] [AC : Z99109 : AL009126] [GN:yhxD] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
6 of 21) : from 999501 tol209940.] [NT: similar to ribitol dehydrogenase] [SP:P40398] 
[LE:117706] [RE:118605] [DI : complement] >gp: [GI : e324973 :g2226172] [LN: BSY14081] 
[AC:Y14081] [PN : hypothetical protein] [GN: yhxD] [OR: Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis chromosomal DNA, region 92 degrees: 
regionbetween comK and addAB.] [NT : bp 1-501 overlaps with bp 1525-1947 (end) from EMBL] 
[SP:P40398] [LE:1] [RE: 900] [D I : complement] 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength 

A17503000986_iill4677_i2J7:i | pm | |5123 |p^~| 



Description 
NO-HIT 



346 



ORF Name 



NT 



AA 



^ LeiithLeigth^ore probability 



| AlVbt)300098fc 117407Va t'2 80 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID Le ^ thLe ^ th score probability 



E 



17503000986 1207250 C3 313 



TT5T" 



[5125 jpgP~||iS9 |p ||0.032 



Description 

sp: [LN:ATP6_APIME] [AC:Q00275] [GN : ATP6] [ORrAPIS MELLIFERA] [SR :, HONEYBEE] 
[EC: 3. 6. 1.34] [DE : ATP SYNTHASE A CHAIN, (PROTEIN 6)] [SP:Q00275] [DB : swissprot] 
>pir: [LN:A42622] [AC : A42622 : S52964 ] [PN:H+- transporting ATP synthase, chain 6] 
[CL:H+- transporting ATP synthase protein 6] [OR : mitochondrion Apis mellifera] [SR: , 
honeybee] [EC : 3 . 6 . 1 . 34] [DB:pir2] >gp : [GI :g552442] [LN: AMFGENOM] [AC:L06178] [PN : ATPase 
subunit 6] [OR : Mitochondrion Apis mellifera ligustica] [SR: common honeybee] 
[DB :genpept-invl] [DE:Apis mellifera ligustica complete mitochondrial genome.] 
[LE:4584] [RE:5264] [DI:direct] >gp : [GI :g552451] [LN: AMFMTATPAS] [AC:M87065] 
[PN:adenosine triphosphatase 6] [GN: ATPase6] [OR : Mitochondrion Apis mellifera 
ligustica] [SR : Mitochondrion Apis mellifera ligustica (organelle Mitochondrio] 
[DB :genpept-invl] [DE : Apis mellifera mitochondrial adenosine triphosphatase 8 
(ATPase8) gene, complete cds; adenosine triphosphatase 6 (ATPase6) gene , complete cds.] 
[LE:141] [RE:821] [DI:direct] 



ORF Name 



IA17503000986 1290703 cl 227 



Description 
NO-HIT 



ORF Name 



IAI7503000986 14644018 tl 39 



NTID 



II35T 



NT AA 
AAID — . _. — . , score 
Length Length 



probability 



[5125 |p?T 1 



NT AA 
NTID AAID — _ „ — , scor e 
LengthLength 



probability 



-J [1355 | [5127 |pST"~||322 ||324 ||3.4e-29 



Description 

gp: [GI:g2196513] [LN: SEU77778] [AC :U77778 :U29130] [PN:putative membrane protein] 
[GN : epiH] [FN: involved in epidermin secretion] [OR : Staphylococcus epidermidis] 
[DB:genpept-bctl] [DE : Staphylococcus epidermidis plasmid pTue32 putative ABC 
transportersubunits (epiG) , (epiE) , and (epiF) , putative membrane protein (epiH) , EpiT' 
(epiT') and EpiT* ' (epiT' 1 ) genes, complete cds . ] [NT : EpiH] [LE:2615] [RE:3607] 
[DI :direct] 



ORF Name 



E 



17503000985 14647750 c2 238 



Description 
NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



J [1355 | [5128 1|46 | 



ORF Name 



AI7503000986_14656952_t2_93 

Description 

NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



■] [1357 | [5129 |pi~ ||210 | 



347 



NT AA 

0RF Name *™ MID Le — thLe — th score probability 

|A17b030009^_lb0Jb9b2_t2_91 | [1355 | [5130 ||477 | |15fl | 

Description 
NO-HIT 



NT AA 

ORF Name NT ID AAID — — score probability 



IAI7503000985 15117317 c2 243 



LengthLength - 
1359 | [5131 |pgT]| 727 | || 2 . 4e-102 



Description 

sp : [LN : TAGF_BACSU] [AC:P13485] [GN : TAGF : RODC : TAG3 ] [OR:BACILLUS SUBTILIS] [DE:TEICHOIC 
ACID BIOSYNTHESIS PROTEIN F] [SP:P13485] [DB : swissprot] >pir : [LN : S0604 9] 
[AC:S06049 :G69720 ] [PN:probable CDPglycerol glycerophosphotransf erase, 
:CDP-glycerol :polyglycerol phosphate glycero-phosphotransf erase tagFrrodC 
protein: teichoic-acid synthase] [GN : rodC : tag- 3 : tagF ] [OR:Bacillus subtilis] 
[EC:2. 7.8.12] [DB:pir2] [MP:310 degrees ] >gp : [GI :g40100] [LN: BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NT:rodC 
(tag3) polypeptide (AA 1-746)] [SP:P13485] [LE:2178] [RE:4418] [DI:direct] 
>gp: [GI:ell84478:g2636098] [LN : BSUB0019] [AC : Z99122 : AL009126] 

[PN:CDP-glycerol :polyglycerol phosphate] [GN:tagF] [FN:teichoic acid biosynthesis] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] [SP:P13485] 
[LE:78129] [RE:80369] [DI : complement] >gp : [GI :ell84478:g2 636098] [LN:BSUB0019] 
[AC:Z99122 :AL009126] [PN : CDP-glycerol :polyglycerol phosphate] [GN: tagF] [FN: teichoic 
acid biosynthesis] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete 
genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] 
[SP:P13485] [LE : 78129] [RE: 80369] [DI : complement] 



NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength 



a75030009S6_I5I365S2_c2_253 ] [1360 | [5132 ||T5Tyu~|| 499 1 |1636 || 3 . 2e-168 



Description 

gp: [GI:dl039113:g4514332] [LN: AB013369] [AC : AB013369] [OR:Bacillus halodurans] 
[SR: Bacillus halodurans (strain : C- 125 ) DNA] [DB : genpept-bctl] [DE: Bacillus halodurans 

C-125 yesT and comEC genes, partial andcomplete cds . ] [NT:unknown] [LE:4328] [RE:5830] 
[DI: direct] 



NT AA 

ORF Name NTID AAID — . w — L score probability 
LengthLength c 



A17b030009a6_i582913S_c2_237 | [1361 | [ 5133 |pu^| |67 | [Tu3~~j| l . Oe-05 



Description 

pir: [LN:E69764] [AC:E69764 ] [PN : hypothetical protein ycnl] [GNrycnl ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI :ell82361:g2 632695] [LN : BSUB0003] [AC : Z99106 : AL009126] 
[GN:ycnI] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 3 of 21): from 402751 to611850.] [LE:43004] [RE:43618] 
[DI complement] >gp: [GI :dl009660 :gl805463] [LN:D50453] [AC:D50453] [GN:ycnI] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168 trpC2) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis DNA for 25-36 degree region containing theamyE-srfA region, 
complete cds.] [LE: 125407] [RE: 126021] [D I : complement] 



NT AA 

ORF Name NTID AAID — — , score probability 
LengthLength -L 

AI7503000986_i59377_c3_3i7 | [1352 | [5134 | | 171 | | 5S | 

Description 

NO-HIT 



348 



ORF Name 



NT 



AA 



^ MID Le - thLe - th £Core probability 



1AI7503000386 17047b7b r3 150 



Description 
NO-HIT 



ORF Name 



NT ID 



AAID 



NT 



AA 



Length Length 



score 



|AI7503000986_174218_ciJ>09 



probability 
l.ie-124 



Description 

Sp : [LN : YHCA_BACSU] [AC:P54585] [GNiYHCA] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 58.3 
KD PROTEIN IN GLPD-CSPB INTERGENIC REGION] [SP:P54585] [DB : swissprot] >pir : [LN : E6 9821] 
[AC:E69821 ] [PN : multidrug resistance protein homolog yhcA] [GNryhcA ] 
[CL: lincomycin-resistance protein lmrB] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell82890:g2633224] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN : yhcA] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NT:similar to multidrug resistance protein] 
[SP:P54585] [LE:174464] [RE:176062] [DI: direct] 



ORF Name 



LAI7503 



000985 1796S3 c3 325 



NTID 



NT 

AAID _ — 



AA 



Length Length 



[5137 



score probability 
1100 110.015 



Description 

sp: [LN:TCR2_BACSU] [AC:P14512] [GN : TET] [OR : BACILLUS SUBTILIS] [DE : TETRACYCLINE 
RESISTANCE PROTEIN] [SP:P14512] [DB : swissprot] >pir: [LN:S42238] [AC:S42238 ] 
[PN: tetracyclin resistance protein] [GN:tet ] [CL : tetracycline resistance protein] 
[OR staphylococcus aureus] [DB:pir2] >gp : [GI :g476735] [LN:PNS1CG] [AC:M16217] [GN:tet] 
[OR:Plasmid pNSl] [SR:Plasmid pNSl from Staphylococcus aureus, plasmid pTP5 DNA] 
[DB :genpept-bctl] [DE: Plasmid pNSl (from Staphylococcus aureus) 

encodingtetracycline-resistance (tet) , complete genome.] [LE:305] [RE:1684] [DI:direct] 

>gp: [GI:g456770] [LN:S67449] [AC:S67449] [PN : Tet (K) ] [GN:tet(K)] [OR : Staphylococcus 

aureus] [SR : Staphylococcus aureus pT181] [DB : genpept-bctl] [DE : tet (K) tetracycline 

efflux protein [Staphylococcus aureus, pT181 , Plasmid, 1380 nt] . ] [NT : tetracycline 

efflux protein; This sequence comes] [LE:1] [RE:1380] [DI:direct] >gp : [GI :gl052998] 

[LN:SAU38428] [AC:U38428] [PN : tetracycline resistance protein] [GN:tet] 

[OR staphylococcus aureus] [SR : Staphylococcus aureus plasmid pKH6] [DB : genpept -bctl ] 

[DE: Staphylococcus aureus tetracycline resistance plasmid pKH6 , complete sequence.] 

[LE:321] [RE:1700] [DI:direct] >gp : [GI : gl053140] [LN : SAU38656] [AC:U38656] 

[PN: tetracycline resistance protein] [GN:tet] [OR : Staphylococcus aureus] 

[SR: Staphylococcus aureus plasmid pKHl] [DB : genpept-bctl] [DE : Staphylococcus aureus 

tetracycline resistance plasmid pKHl, tetgene, complete cds . ] [LE:436] [RE: 1815] 

[DI: direct] 



ORF Name 



NTID 



A17S03000985_19548755J:3_132 

Description 

NO-HIT 



„ TT ^ NT AA 

AAID — — _ score 

Length Length 



probability 



ORF Name 



NT 



AA 



^ AAID Le - thLe - th 3cgre probability 



LAI7503000986 1970178 r3 145 



Description 
NO-HIT 



] F g7 I | 5135 |prg~||72 | 



349 



ORF Name 



NTID 



NT AA 
AAID — • _. — score 
LengthLength 



probability 



|A17503000986J.97429(l>2_cl_221 



] [1358 | [5140 ||102^"| |341 |pi~| 



l.le-39 



Description 

gp: [GI:el486019:g4995689] [LN : LLAO 11653] [AC : AJ011653 ] [PN:aldose 1-epimerase] 
[GN:galM] [FNimutarotase] [OR : Lactococcus lactis] [DB :genpept-bctl] [EC:5.1.3.3] 
[DE: Lactococcus lactis (strain MG13 63) galactose operon (galAMKTEgenes) . ] [LE:1990] 
[RE:3009] [DI:direct] 



ORF Name 



^7503000985 199S0885 ci 228 



Description 
NO-HIT 



ORF Name 



IAI7503000986 20579752 c3 319 



Description 
NO-HIT 

ORF Name 



U£7B030009fl6 20580443 c3 310 



NTID 



^„ NT AA 

AAID — — , , score 

LengthLength 



probability 



5T3T" 



IEID 



FT" 



NT AA 
NTID AAID — . — score 
LengthLength 



probability 



"j [1370 | [5142 1|49 | 



NTID 



NT AA 
AAID — — , score 
LengthLength 



probability 



] | 1371 1 j 5143 IP 3- !! 110 | p3~~ | p.5e-13 



Description 

gp: [GI:g3582220] [LN : AE001272] [AC : AE001272] [PN: conserved hypothetical protein] 
[GN:ORF00047] [OR : Lactococcus lactis] [DB :genpept-bct2] [DE : Lactococcus lactis DPC3147 
plasmid pMRCOl, complete plasmidsequence . ] [NTisimilar to GB:X69895 SP:P39044 PID:40067 
percent] [LE: 44384] [RE: 44728] [DI: direct] 



ORF Name 



IAI7503000986 20718790 t3 138 



Description 
NO-HIT 



NTID 



TY7T 



„ 7VTT . NT AA 

AAID — . T — , , score 

Length Length 



probability 



5144 11150 



ORF Name 



E 



17503000986 20789507 c2 280 



Description 
NO-HIT 



NTID 



][ 



TT7T 



„ Trfc NT AA 

AAID — — score 

LengthLength 

[5145 ||T35 | |51 I 



probability 



ORF Name 



NT AA 
NTID AAID , — _ — , score 
LengthLength 



1^17503000986 20897125 ci 212 



[1374 | [5146 



TUT 



probability 
2.9e-48 



Description 

pir: [LN:H69806] [AC:H69806 ] [PN:divalent cation transport protein homolog yf jQ] 
[GN:yfjQ ] [CL:magnesium and cobalt transport protein] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell82790:g2633124] [LN: BSUB0005] [AC : Z99108 : AL009126] [GN:yfjQ] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NTrsimilar to divalent cation transport protein] 
[LE:68033] [RE:68992] [DI : complement] >gp : [GI : dl025214 :g2780401] [LN:D78509] 
[AC:D78509] [PN: Yf jQ] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :AC327 ) DNA] 
[DB: genpept-bctl] [DE:Bacillus subtilis YfjG-YfjR genes, complete cds . ] [LE: 17333] 
[RE : 18292 ] [DI : direct] 



350 



ORF Name 



NT AA 

^ Mil) Le - thLe ~ th score probability 



| AlVi>0300098fc el 190 



Description 
NO-HIT 



ORF Name 



NT AA 

NTID AAID Le ~ t frLeng th score probability 



117503000986 22074200 t3 184 



TTTT 



Description 
NO-HIT 



[5148 



rrzz i 



ORF Name 



NTID AAID 



NT 



AA 



T — , » _ — score probabil ity 
Length Length c -L 



LA17503000986 2227721b 13 142 



] [1377 | [5149 |[TT5¥]|387 | pre - 1|1 . 5e-05 



Description 

pir: [LN:C44863] [AC:C44863 ] [PN:R45 antigen] [OR : Plasmodium falciparum] [DB:pir3] 



ORF Name 



NT AA 
NTID AAID — _ — , - score 
LengthLength 



probability 



IAI7503000986 22455213 13 i£9 



] [1378 | [ 5150 | p7— 1| 228 ||3 Ti |p.le-38 " 



Description 

gp: [GI:g4982229] [LN: AE001807] [AC : AE001807 : AE000512] [PN:response regulator DrrA] 
[GN:TM1655] [OR : Thermotoga maritima] [DB : genpept-bct2] [DE : Thermo toga maritima section 
119 of 136 of the complete genome.] [NT: similar to PID: 1575577 GB:AE000512 percent] 
[LE : 12477] [RE : 13220] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ T — , , score proba bility 
LengthLength c 



IAI7503000985 22857942 cl 224 



1379 



[5151 |[7n~]|236 ||417 ||4 . 8 



e-39 



Description 

gp: [GI:g4262236] [LN : ATAC0062 00] [AC : AC0062 00] [PNrputative ribose 5-phosphate 
isomerase] [GN: F10A8 . 17] [OR : Arabidopsis thaliana] [SRrthale cress] [DB : genpept -pln2 ] 
[DE:Arabidopsis thaliana chromosome II BAC F10A8 genomic sequence, complete sequence.] 
[LE:58788] [RE:59585] [DI: compl ement ] 



NT AA 

ORF Name NTID AAID — _„ — , _ score probabil ity 
LengthLength *- 



|AI7503000986_22859687_ci_186 | [1380 | [5152 |p^~] |145 ||143 ||5.2e-10 



Description 

pir: [LN:A64946] [AC:A64946 ] [PN : hypothetical protein bl841] [CL:copper resistance 
protein pcoC] [OR : Escherichia coli] [DB:pir2] >gp : [GI :gl788146] [LN: AE000278] 
[AC:AE000278 :U00096] [PN:orf, hypothetical protein] [GN:bl841] [FN:orf; Unknown] 
[OR Escherichia coli] [DB : genpept -bct2] [DE : Escherichia coli K-12 MG1655 section 168 of 
400 of the completegenome. ] [NT:fl24; This 124 aa ORF is 39 pet identical (6 gaps)] 
[LE:4445] [RE: 4819] [DI : complement ] 



ORF Name 


NT AA 

NTID AAID L — — th score probability 


|AI7503000986_23bb6 552_c2_240 


1 1381 M5153 11405 11134 11123 ||6.9e-08 1 


Description 



sp: [LN:YCX1_P0RPU] [AC:P51192] [OR : PORPHYRA PURPUREA] [DE : HYPOTHETICAL 20.1 KD PROTEIN 
IN YCF37-PSAF INTERGENIC REGION (ORF174)] [SP:P51192] [DB : swissprot] >pir : [LN: S73113] 
[AC:S73113 ] [PN : hypothetical protein 174] [OR : chloroplast Porphyra purpurea] [DB:pir2] 



351 



ORF Name 



E 



17503000986 23601557 tl 10 



Description 
NO-HIT 



NTIE) AAID Le g thLe g th ^ore probability 



[1382 | [5154 ||201~[|66 | 



ORF Name 



117503000986 23601577 t3 144 



Description 
NO-HIT 



ORF Name 



IAI7503000986 23611437 cl 188 



Description 
NO-HIT 



NT AA 

222 MID Le - thLe - th score probability 



] [1383 | [5155 ||Tgg~~| [61 | 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



] [1384 | [5156 |[TI5— | p 



ORF Name 



NT 



AA 



NTID AAID — _„ — _ score probability 
LengthLength *■ 



IA17503000986 23839193 cl 193 



1385 | [5157 |pT¥]j537 [ |1324 ||3 . 7e-l"31T 



Description 

pir: [LN:F69649] [AC:F69649 ] [PN : L- lactate permease IctP] [GN:lctP ] [CL:L- lactate 
permease] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82258 :g2632592] [LN: BSUB0002] 
[AC:Z99105 :AL009126] [PN : L- lactate permease] [GNilctP] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 2 of 21) : from 194651 
to415810.] [NTralternate gene name: ycgC] [LE:135677] [RE:137302] [DI:direct] 



ORF Name 



IAI7503000986 23876887 tl 60 



Description 
NO-HIT 



NTID 



7v7vtt ^ NT AA 

AAID — , « _ — , , score 

LengthLength 



probability 



|5158 ||I?3 | 



ORF Name 



E 



17503000986 23923412 tl 31 



Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



|1387 | [5159 |p5~||74 [ 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



1AI7503000986 23929627 t3 160 



[5160 ||177~||58 |p" 



probability 
10.046 



Description 

pir: [LN:I40601] [AC : 140601 : S70950 ] [PN: mobilization protein] [GN:mobA ] 
[OR:Bacteroides vulgatus] [DB:pir2] >gp : [GI :gl079659] [LN : BVU38243] [AC :U38243 :M72418] 
[PN:mobilization protein] [GN:mobA] [FN:conjugal transfer of Tn4555] [OR : Bacteroides 
vulgatus] [DB : genpept-bctl] [DE : Bacteroides vulgatus beta- lactamase (cfxA) gene, 
complete cds andmobilization protein (mobA) gene, complete cds . ] [LE:1208] [RE: 2611] 
[DI : complement] 



ORF Name 



NTID AAID T — _ „ — , score 
LengthLength 



probability 



^17503000986 24250317 cl 203 



] [1389 | |5161 ||T55 | pT" 



Description 
NO-HIT 



352 



ORF Name 



NT 



AA 



AI7bO300098fe_^4^fe7i>6V_t3_163 



^ MID Le - thLe - th score probability 
| 1J90 | [ 5152 || 545 1 1214 |p || 0.004fe 



] 



Description 

pir: [LN:C70649] [AC:C70649 ] [PN : hypothetical protein Rv3058c] [GN:Rv3058c ] 
[OR: Mycobacterium tuberculosis] [DB:pir2] >gp : [GI : e290931 : gl781155] [LN:MTCY22D7] 
[AC:Z83866:AL123456] [PN : hypothetical protein Rv3058c] [GN:Rv3058c] [OR : Mycobacterium 
tuberculosis] [DB :genpept-bctl] [DE : Mycobacterium tuberculosis H37Rv complete genome; 
segment 133/162.] [NT:Rv3058c, (MTCY22D7 . 23 ) , len: 216. Function:] [LE:4546] [RE:5196] 
[DI : complement] 



ORF Name 








NT ID 


NT AA 
AAID , — . , „ — . , score 
LengthLength 


===— ™==— 
probability 




AI750300098S_ 


_24394175_ 




_151 


| |139i 


| |5163 ||159 | 52 | 






Description 
















NO-HIT 
















ORF Name 








NT ID 


_ _ NT AA 
AAID , — . , , — . , score 
LengLhLength 


probability 




AI7503000986_ 


_24407827_ 




_2SS 


| |1392 


| |5164 | 561 | |186 | 






Description 
















NO-HIT 
















ORF Name 








NT ID 


TVTvxr, NT AA 
AAID . — . , _ — . , score 
LengthLength 


probability 




|AI7503000986_ 


_24412826_ 


_ c5 - 




1 1393 


| 963 320 286 3 


.7e-25 


i 



Description 

sp : [LN : HUTG_KLEAE] [AC:P19452] [GN:HUTG] [OR : KLEBSIELLA AEROGENES] [EC:3.5.3.8] 
[DE: (HISTIDINE UTILIZATION PROTEIN G) (FRAGMENT)] [SP:P19452] [DB : swissprot ] 
>gp: [GI:gl49204] [LN : KPNHUTC] [AC:M34604] [OR : Klebsiella aerogenes] [SR : Klebsiella 
aerogenes (strain W70) DNA] [DB :genpept-bctl] [DE :K. aerogenes histidine utilization 
repressor C (hutC) gene, completecds . ] [NT: histidine utilization repressor G] [LE:<1] 
[RE: 669] [DI: direct] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000986 24415923 cl 211 



[1394 | [5156 |pP5~||300 ||282 ||9.7e-2T 



Description 

gp: [GI:gl209223] [LN: ACCEST] [AC:L38252] [PN : esterase] [GN:est] [OR : Acinetobacter 
lwoffii] [DB:genpept-bctl] [DE : Acinetobacter lwoffii orfl and esterase (est) genes, 
complete cds . ] [LE:638] [RE:1549] [DI:direct] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



pi57 |p^~| prr 



score probability 
7.3e-32 



FT5~ 



J 



|AI7 5 03 000 986_24 6 62 82 5_C2_2 7 9 
Description 

pir: [LN:H70068] [AC:H70068 ] [PN : hypothetical protein ywrF] [GN:ywrF ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : el 184514 :g2 636133] [LN:BSUB0019] [AC : Z99122 : AL009126] 
[GN:ywrF] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 19 of 21): from 3597091to 3809700.] [LE:119966] [RE:120583] 
[DI:direct] >gp : [GI : e311284 :gl929333] [LN : BSZ93767] [AC:Z93767] [GN: ywrF] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis DNA; 15.2 kb fragment, from ywqN gene to 
ywrO gene.] [LE:4588] [RE:5205] [D I : complement] >gp : [GI : ell84514 : g2636133] 
[LN:BSUB0019] [AC : Z99122 : AL009126] [GN: ywrF] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept] [DE:Bacillus subtilis complete genome (section 19 of 21) : from 3597091to 
3809700.] [LE:119966] [RE:120583] [DI:direct] 



353 



ORF Name 



NT ID AAID 



NT 



AA 



LengthLength 



score probability 



^17503000986 2470010 cl 191 



"j [1396 | [5158 |pi~||240 |p7— ||1.3e-08 



Description 

gp: [GI:el407791:g4493935] [LN : PFMAL3P5] [AC : AL034556] [GN : MAL3P5 . 8] [OR : Plasmodium 
falciparum] [SRrmalaria parasite P. falciparum] [DB :genpept- invl] [DE : Plasmodium 
falciparum MAL3P5, complete sequence.] [NT : predicted using hexExon; MAL3P5.8 
(PFC0610c),] [LE:29992] [RE:33537] [DI : complement] 



ORF Name 



NT 



AA 



|AI7B03000986_24a033fl6_ci_20i 



NTID AAID , score probability 

|i.4e-15 



LengthLength : 



1397 [ [5169 ||i080||359 ||ZTg 



Description 

pir: [LN:G70728] [AC:G70728 ] [PN : hypothetical protein Rv2563] [GN:Rv2563 ] 
[OR: Mycobacterium tuberculosis] [DB:pir2] >gp : [GI : el299946 : g3261618] [LN:MTCY9C4] 
[AC:Z77250 :AL123456] [PN : hypothetical protein Rv2563] [GN:Rv2563] [OR : Mycobacterium 

tuberculosis] [DB :genpept-bctl] [DE : Mycobacterium tuberculosis H37Rv complete genome; 

segment 113/162.] [NT:Rv2563, (MTCY9C4 . 05c) , len: 349. Unknown membrane] [LE:5757] 
[RE: 6806] [DI: direct] 



ORF Name 



[AI 7 503000985 2540907 c3 304 



Description 
NO-HIT 



NTID AAID 



NT 



AA 



TT9TT 



LengthLength 



score 



probability 



|5i7o j|rgs i pr~ 



ORF Name 



NTID 



NT 



AA 



score 



1 7 503000986 2bb27188 cl 225 



AAID 

LengthLength 

[5171 ||TTTr||378 ||830 | |i 



probability 
3T3e^ 



Description 

pir: [LN:E69640] [AC:E69640 ] [PN : hippurate hydrolase hipO] [GN:hipO ] [CL : hippurate 
hydrolase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84178 :g2635394] [LN : BSUB0015] 
[AC:Z99118 :AL009126] [PN : hippurate hydrolase] [GN:hipO] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC : 3 . 5 . 1 . 32] [DE: Bacillus subtilis complete genome (section 15 of 
21): from 2795131to 3013540.] [LE:203660] [RE:204910] [DI : complement] 

>gp: [GI:ell85802:g2635413] [LN: BSUB0016] [AC: Z99119 : AL009126] [PN : hippurate hydrolase] 
[GN:hipO] [OR: Bacillus subtilis] [DB :genpept-bctl] [EC : 3 . 5 . 1 . 32] [DE: Bacillus subtilis 
complete genome (section 16 of 21): from 2997771to 3213410.] [LE:1020] [RE:2270] 
[DI: complement] >gp : [GI :g2293256] [LN: AF008220] [AC : AF008220] [PN:putative hippurate 
hydrolase] [GN:hipO] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 
rrnB-dnaB genomic region.] [LE:178157] [RE:179407] [DI:direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



IA17503000986 25551^40 cl 214 



[3~4TRT 



|5i72 jjggT — | peg \pr? 



score probability 
10.0023 



Description 

gp: [GI:el331922 :g3758855] [LN : PFMAL3P6] [AC:Z98551] [GN :MAL3P6 . 11] [OR : Plasmodium 
falciparum] [SR:malaria parasite P. falciparum] [DB : genpept- invl] [DE : Plasmodium 
falciparum MAL3P6, complete sequence.] [NT : predicted using hexExon; MAL3P6.11 
(PFC0760c),] [LE: 53772] [RE: 63956] [DI : complement] 



354 



ORF Name 

1A17503000986 25bV9390 tl 50 



NTID 



NT AA 
AAID — « — score 
LengthLength 



probability 
j [1401 | [ 5173 |[TT5¥"|p97 |[ 402 || 8.Se-37 



Description 

gp: [GI:el294490:g3169038] [LN:SC1C3] [AC : AL023702] [PN:putative transferase] 
[GN:SC1C3 . 12] [OR : Streptomyces coelicolor] [DB :genpept-bctl] [DE : Strep tomyces 
coelicolor cosmid 1C3 . ] [NT : SC1C3 . 12 , possible transferase, len: 697 aa; weakly] 
[LE:13186] [RE:15279] [DI: direct] 



ORF Name 



NTID AAID 



[)M7503 



000986 25910952 c3 309 



NT AA 
LengthLength 
[5174 |[T¥T | |4fi | 



score probability 



Description 
NO-HIT 



ORF Name 



LA17503000986 26753588 tl 44 



NTID 



TTuT" 



NT 



AA 



AAID . — , . 



LengthLength 



score 



|5175 |[Q5X] [460 



probability 
2.6e-31 



Description 

pir: [LN:H69762] [AC:H69762 ] [PN : two -component sensor histidine kinase homolog yclK] 
[GN:yclK ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell82343 : g2632677] [LN: BSUB0003] 
[AC:Z99106:AL009126] [GN : yclK] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 3 of 21): from 402751 to611850.] 
[NT:similar to two-component sensor histidine kinase] [LE:24077] [RE:25498] [DI:direct] 
>gp: [GI:dl009643:gl805446] [LN:D50453] [AC:D50453] [PN:homologue of alkaline 
phosphatase synthesis] [GN:yclK] [ORrBacillus subtilis] [SR:Bacillus subtilis 
(strain:168 trpC2) DNA] [DB : genpept-bctl] [DErBacillus subtilis DNA for 25-36 degree 
region containing theamyE-srfA region, complete cds . ] [LE: 106480] [RE: 107901] 
[DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



— _ T — _ score probability 
LengthLength c 



1A?7503000986 272550 c2 268 



Description 
NO-HIT 



"J [1404 | [5176 ||123 1 140 | 



ORF Name 



NT AA 

NTID AAID — « — , , score probability 
LengthLength c *- 



IAI7503000986 292883 c2 229 



Description 
NO-HIT 

ORF Name 



] [1405 | [5177 |p-g— ]| 4i | 



NTID AAID 



NT 



AA 



„ — ^, T — , , score probability 
LengthLength c 



LAI7503000986 29565627 c3 286 



[5178 [[573 [ |292 | 



Description 
NO -HIT 

ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



probability 



IAI7503000986 30476575 ci 216 



Description 
NO-HIT 



1407 | [5179 | prr^]| 54 | 



355 



ORF Name 



NTID 



NT 

AAID 



AA 



Length Length 



score probability 



U.7503000986 30703458 c2 230 



[1408 | [5180 [ [SOS | |301 | pTT]| 2 . 3e-126 



Description 

Sp : [LN : PTSB_STAXY] [AC:P51184] [GN : SCRA] [OR : STAPHYLOCOCCUS XYLOSUS] [EC : 2 . 7 . 1 . 69] 
[DE : (EC 2.7.1.69) (EII-SCR) ] [SP:P51184] [DB : swissprot] >pir : [LN : S3 9978] [AC:S39978 ] 
[PN:scrA protein] [CL : phosphotransferase system sucrose-specific enzyme II, factor II] 
[OR: Staphylococcus xylosus] [DB:pir2] >gp : [GI :g407908] [LN:SXSCRA] [AC:X69800] 
[PN:EIIscr] [GNrscrA] [OR : Staphylococcus xylosus] [DB : genpept-bctl] [DE : S . xylosus scrA 
gene and unidentified open reading frames.] [NT : ORF2] [SP:P51184] [LE:1053] [RE:2495] 
[DI : direct] 



ORF Name 



NTID AAID „ — 



7503000985 34032561 c3 316 



NT AA 
LengthLength 



score probability 



TO" 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score probability 



iI7503000986 34199202 cl 189 



] [1410 | [5182 |pu~||179 |pT3~[|3.3e-31 



Description 

gp: [GI:ell84294:g2635712] [LN : BSUB0017] [AC : Z99120 : AL009126] [PN : transcript ional 
regulator] [GN :paiA] [FN:negative regulation of sporulation, septation] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 17 of 21) : 
from 3197001to 3414420.] [LE:106800] [RE:107318] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — __ — _ score 
LengthLength 



probability 



IAI7503000986 34610667 c3 297 



] | 1411 | |pu~|p29 |pB~[|3.9e-44 



Description 

pir: [LN:B69377] [AC:B69377 ] [PN:ABC transporter, ATP-binding protein homolog] 
[CL : unas signed ATP-binding cassette proteins: ATP-binding cassette homology] 
[OR:Archaeoglobus fulgidus] [DB:pir2] >gp : [GI : g2649576] [LN : AE001033] 
[AC:AE001033 :AE000782] [PN:ABC transporter, ATP-binding protein] [GN:AF1018] 
[OR:Archaeoglobus fulgidus] [DB : g enpep t -bet 2 ] [DE : Archaeoglobus fulgidus section 74 of 
172 of the complete genome.] [NT:similar to GB:L77117 SP:Q58206 PID:1591493 percent] 
[LE : 2126] [RE : 2812] [DI : complement] 



ORF Name 



_ _ j- . _ _ ^TT 

NTID AAID — — , „ score 

LengthLength 



probability 



|AI7503000986_35955213_c3_302 

Description 

NO-HIT 



"j [1412 | [5184 



1165 I 



356 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



[A17503000986 3bl40%3 12 120 



~j [1413 | [5185 |[T¥3T1|476 ~~||226 ||1.9e-15 



Description 

sp: [LN:TAGF_BACSU] [AC:P13485] [GN : TAGF : RODC : TAG 3 ] [OR:BACILLUS SUBTILIS] [DE : TEICHOIC 
ACID BIOSYNTHESIS PROTEIN F] [SP:P13485] [DB : swissprot] >pir : [LN : S06 04 9] 
[AC:S06049:G69720 ] [PN:probable CDPglycerol glycerophosphotransf erase , 
:CDP-glycerol :polyglycerol phosphate glycero-phosphotransf erase tagFrrodC 
protein: teichoic-acid synthase] [GN : rodC : tag- 3 : tagF ] [OR:Bacillus subtilis] 
[EC:2.7.8.12] [DB:pir2] [MP : 310 degrees ] >gp : [GI :g40100] [LN : BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NTrrodC 
(tag3) polypeptide (AA 1-746)] [SP:P13485] [LE:2178] [RE:4418] [DI: direct] 
>gp: [GI:ell84478 :g2636098] [LN : BSUB0019] [AC : Z99122 : AL009126 ] 

[PN:CDP-glycerol rpolyglycerol phosphate] [GN:tagF] [FN:teichoic acid biosynthesis] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] [SP:P13485] 
[LE:78129] [RE:80369] [DI : complement ] >gp: [GI :ell84478:g2 636098] [LN : BSUB0019] 
[AC:Z99122 :AL009126] [PN : CDP-glycerol : polyglycerol phosphate] [GN:tagF] [FN:teichoic 
acid biosynthesis] [OR: Bacillus subtilis] [DB:genpept] [DE: Bacillus subtilis complete 
genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodC] 
[SP:P13485] [LE:78129] [RE:80369] [DI : complement] 



ORF Name 



E 



7503000986 3906385 t3 183 



Description 
NO-HIT 



NTID AAID 



NT 



AA 



LengthLength 



— score probability 



] [1414 | [ 6186 |pg— [ |4i | 



ORF Name 



|M7503000986_39575il_c2_248 



Description 
NO-HIT 



][ 



1415 



NT 



AA 



NTID AAID — _ T — , , score proba bility 
LengthLength ^ 



[5187 ||126 | pT 



ORF Name 



17503000986 4062562 c2 269 



Description 
NO-HIT 



NT AA 

NTID AAID — *^ — , , score probability 
LengthLength 



"] [1416 | [5188 |[TF3— |p | 



ORF Name 



IAI7503000986 4178140 c2 270 



Description 
NO-HIT 



NT AA 

NTID AAID — — , , score probabil ity 
LengthLength *- 



] | 1417 | |pn~||46 | 



ORF Name 



IA17503000986 4297627 12 119 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ t fr score probability 



1418 I 15190 11132 1 



ORF Name 



[A:H7503000986_4382062_ri_i9 

Description 

NO-HIT 



NT AA 

NTID AAID Le ^ thLe ~ th score probability 



1419 | [5191 |p"5~~[ |S2 



357 



NT AA 

ORF Name NTID AAID Le ^ thLe ^ t ] 1 scor e probability 



|A17S030009flfc_449b0<>2_c3_2B7 | [1420 | [ 5192 |[ 345 | |114 | pa || 1.2e-12 

Description 

pir: [LN:E69764] [AC:E69764 ] [PN : hypothetical protein ycnl] [GNiycnl ] [OR:Bacillus 

subtilis] [DB:pir2] >gp: [GI : ell82361 :g2632695] [LN: BSUB0003] [AC : Z99106 : AL009126] 
[GN.-ycnl] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 

complete genome (section 3 of 21): from 402751 to611850.] [LE:43004] [RE:43618] 
[DI: complement] >gp : [GI :dl0 09660 :gl8 0546 3] [LN:D50453] [AC:D50453] [GN:ycnI] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168 trpC2) DNA] [DB : genpept -bet 1 ] 
[DE: Bacillus subtilis DNA for 25-36 degree region containing theamyE-srfA region, 

complete cds . ] [LE: 125407] [RE: 126021] [D I : complement] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A17B03000986_46flV893_cl_213 | [1421 | [5193 [[TuTT] |352 || 2.8e-b0 | 



Description 

gp: [GI:g2822338] [LN : AF016485] [AC : AF016485] [OR : Halobacterium sp . NRC-1] 
[DB : genpept -bet 2] [DE : Halobacterium sp. NRC-1 plasmid pNRClOO, complete plasmid 
sequence.] [NT : ORF H0660; similar to ORF in Sulfolobus] [LE:60258] [RE:61397] 
[DI : complement] >gp : [GI : g2822427] [LN : AF016485] [AC : AF016485] [OR : Halobacterium sp. 
NRC-1] [DB:genpept-bct2] [DE : Halobacterium sp . NRC-1 plasmid pNRClOO, complete plasmid 
sequence.] [NT : ORF H1696; similar to Sulolobus solf ataricus] [LE: 161992] [RE: 163131] 
[DI :direct] 

NT AA 

ORF Name NTID AAID — _ — , , score prob ability 
LengthLength L - 

[AI7503000986_4689007_ri_49 | [ 1422 | [ 5194 |p^r"] |40 | 

Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ T — , , score probab ility 
LengthLength J - 

[&I750 30 0 0986_4727217_tl_25 | [1423 | [5195 || 402 1 1 133 | 

Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — _ — , score probability 
Length Length 

AI7gQ3000986_4797i25_t3_14a | [1424 | [5195 |pTT| | 548 || 970 || 1.2e-97 



Description 

sp: [LN:ARAB_BACSU] [AC:P94524] [GN : ARAB] [OR:BACILLUS SUBTILIS] [EC : 2 . 7 . 1 . 16] 
[DE : L-RIBULOKINASE , ] [SP:P94524] [DB : swissprot ] >pir : [LN : D69587] [AC:D69587 ] 
[PN:L-ribulokinase araB] [GN:araB ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell84128:g2635344] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN : L-ribulokinase] 
[GN:araB] [FN: L-arabinose utilization] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[EC:2.7.1.16] [DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 
3013540.] [SP:P94524] [LE:149661] [RE:151343] [DI : complement] 

>gp: [GI:ell65307:gl770015] [LN : BSZ75208] [AC:Z75208] [PN : L-ribulokinase] [GN:araB] 
[OR:Bacillus subtilis] [DB : genpept -bctl] [DE : B . subtilis genomic sequence 89009bp.] 
[NT: homology to araB of Escherichia coli ; identified on] [SP:P94524] [LE : 18564] 
[RE : 2 0246] [DI : direct] 



358 



ORF Name 



NT AA 
NT ID AAID — ^ T — , . score 
LenqthLength 



probability 



|Al7b03000986 48^1588 c2 258 



J [1425 | [5197 j| 46B 1 1154 || 134 || 4.7e-09 



Description 

pir: [LNrA69783] [AC:A69783 ] [PN: transcription regulator MarR family homolog ydgG] 
[GN:ydgG ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl020150 :gl881370] [LN : AB001488] 
[AC:AB001488] [GN:ydgG] [ORrBacillus subtilis] [SRrBacillus subtilis (strain: 168) DNA] 
[DBrgenpept-bctl] [DE: Bacillus subtilis genome sequence, 148 kb sequence of the 

regionbetween 35 and 47 degree.] [NT : FUNCTION UNKNOWN . ] [LE:142152] [RE:142610] 
[DI: direct] >gp : [GI:ell82530:g2 632864] [LN : BSUB0003] [AC : Z 99106 : AL009126] [GNrydgG] 
[FN:unknown] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete 

genome (section 3 of 21) r from 402751 to611850.] [NTrsimilar to transcriptional 

regulator (MarR family)] [LEr205898] [RE:206356] [DIrdirect] >gp r [GI : ell82543 rg2632877] 
[LN:BSUB0004] [AC : Z99107 r AL009126] [GNrydgG] [FNrunknown] [ORrBacillus subtilis] 
[DBrgenpept-bctl] [DErBacillus subtilis complete genome (section 4 of 21) r from 600701 

to813890.] [NTrsimilar to transcriptional regulator (MarR family)] [LEr7948] [RE: 8406] 
[DIrdirect] 



ORF Name 



E 



17503000985 5111253 tl 6 



Description 
NO-HIT 



NT ID AAID — — score probability 



NT AA 
Length Length ' 
[5198 |p£7~| [48 | 



ORF Name 



IAI7503000986 5128587 tl 12 



NT ID 



„„ NT AA 

AAID — . T — , , score 

LengthLength 



probability 



][ 



[5199 |pr^[|231 ||336 ||1.8e-30 



Description 

pir r [LN:A69811] [ACrA69811 ] [PN r conserved hypothetical protein yflK] [GNryflK ] 
[CLr hypothetical protein HI0278] [ORrBacillus subtilis] [DBrpir2] 

>gpr [GIrell82755:g2633089] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN : yf IK] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 
5 of 21) r from 802821 tol011250.] [NTrsimilar to hypothetical proteins] [LE: 34765] 
[RE:35430] [DI:direct] >gp : [GI : dl023167 : g2443233] [LNrD86417] [AC:D86417] [PNrYflK] 
[ORrBacillus subtilis] [SRrBacillus subtilis (strain rAC327) DNA] [DBrgenpept-bctl] 
[DErBacillus subtilis 35.7 kb genomic DNA, 70-73 degree region, complete cds . ] 
[LE : 13092] [RE : 13757] [DI : complement] 



ORF Name 



NTID 



„ T „ NT AA 

AAID — — , score 

LengthLength 



[AI7503 



000985 5283390 c2 259 



J 



[5200 |pTF-||47i |pr 



probability 
3.9e-44 



] 



Description 

gp: [GI:g3676414] [LN : AF051917] [AC : AF051917 : LI 9570] [PNrunknown] [OR : Staphylococcus 
aureus] [DB rgenpept-bct2] [DE r Staphylococcus aureus plasmid pSK41, complete sequence.] 
[NTrOrf423] [LEr755] [REr2026] [DIrdirect] 



ORF Name 



NTID 



7V7VTT , NT AA 

AAID — — . score 

LengthLength 



probability 



Al7i>03000986_6307Jb3_IB2 

Description 

NO-HIT 



TT2~9~ 



|520i |p73 [ [5T 



359 



ORF Name 



NT 



AA 



NTID AAID . 

LengthLength 



U7b03000966 6930462 c2 260 



J [1430 | [5202 ||l227|pre~ 



score probability 
634 l|4.9e-62 



Description 

pir: [LN:E69783] [AC:E69783 ] [PN : bicyclomycin resistance protein homolog ydgK] [GN:ydgK 
] [CL: bicyclomycin resistance protein] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl020154:gl881374] [LN: AB001488] [AC : AB001488] [GN:ydgK] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain: 168) DNA] [DB : genpept-bctl] [DErBacillus subtilis genome 
sequence, 148 kb sequence of the regionbetween 35 and 47 degree.] [NT : SIMILAR TO 
BICYCLOMYCIN RESISTANCE PROTEIN.] [LE: 146860] [RE: 148068] [DI: direct] 

>gp: [GI:ell82547:g2632881] [LN: BSUB0004] [AC : Z99107 : AL009126] [GN:ydgK] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
4 of 21): from 600701 to813890.] [NT:similar to bicyclomycin resistance protein] 
[LE:12656] [RE:13864] [DIrdirect] 



ORF Name 



U7503000986 7242812 t3 152 



NTID 



AAID 



NT AA 
LengthLength 
[5203 | |627 | [2TTS" 



score probability 
373 ||2.2e-34 



Description 

sp : [LN : 3MGH_BACSU] [AC:P94378] [GN : YXL J] [OR:BACILLUS SUBTILIS] [EC:3.2.2.-] 
[DE : PUTATIVE 3 -METHYLADENINE DNA GLYCOSYLASE, ] [SP:P94378] [DB : swissprot] 

>pir: [LN:D70082] [AC:D70082 ] [PN: DNA- 3 -methyl adenine glycosidase homolog yxlJ] 
[GN:yxlJ ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell86361 : g2636397] [LN : BSUB0020] 
[AC:Z99123 :AL009126] [GN:yxlJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DErBacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:similar to DNA- 3 -methyl adenine glycosidase] [SP:P94378] [LE:164671] [RE:165261] 
[DI: complement] >gp: [GI :dl 012408 :gl 783264] [LN:D83026] [AC :D83026 :D45911] [GN:yxlJ] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :BGSC 1A1) DNA] [DB : genpept-bctl] 
[DErBacillus subtilis genome sequence covering lic-cel region.] [NT r homologous to DNA 

glycosylases; hypothetical] [LE: 59204] [RE : 59794] [DIr direct] 



ORF Name 



NTID AAID . — 



^17503000986 803137 tl 5 



][ 



TOT- 



NT AA 
Length Length 
[5204 | [TS5 | [5T 



score probability 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID , 



LengthLength 



score 



LA*7503000986 804837 il 59 



] [1433 | [5205 |pi~||32Q [[Im- 



probability 
■3e-3i 



Description 

gp: [GI:e327689:g2407930] [LNr LLPFLDB13] [AC : AJ000326] [GNrorfA] [FN:putative membrane 
protein] [OR r Lactococcus lactis] [DB : genpept-bctl] [DE : Lactococcus lactis pfl gene 
(strain DB1341) .] [LE:464] [RE:1381] [DI:direct] 



ORF Name 



^17503000986 814838 cl 208 



NTID 



NT 



AA 



AAID — 

LengthLength 



score 



pos |po | |219 |pr~ 



probability 
|7.ie-45 



Description 

pir: [LN:D69821] [AC:D69821 ] [PN r hypothetical protein yhbJ] [GNryhbJ ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI rell82889rg2 633223] [LN r BSUB0005] [AC : Z99108 : AL009126] 

[GN:yhbJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 5 of 21): from 802821 tol011250.] [LE:173758] [RE:174423] 

[DI: direct] 



360 



ORF Name 



NT 

NT ID AAID _ 



AA 



Length Length 



score 



probability 



|A17503000986_8i7b7U_c3_32g 



|5207 [p^TT[|4io ~~||708 [|7 . Oe-70 



Description 

pir: [LN:D70179] [AC:D70179 ] [PN:Na+/H+ antiporter (nhaC-1) homolog] [OR:Borrelia 
burgdorferi] [SR:, Lyme disease spirochete] [DB:pir2] >gp : [GI :g2688567] [LN : AE001165] 

[AC:AE001165:AE000783] [PN:Na+/H+ antiporter (nhaC-1) ] [GN:BB0637] [OR:Borrelia 
burgdorferi] [SR:Lyme disease spirochete] [DB :genpept-bct2] [DE:Borrelia burgdorferi 

(section 51 of 70) of the complete genome.] [NTrsimilar to GB:M73530 SP:P27611 
PID:143245] [LE:6377] [RE:7726] [Dlidirect] 



ORF Name 



NT ID 



NT 



AA 



^£7503000985 8568 c3 289 



AAID , — . , score probability 

110.017 



LengthLength 1 



|143g | [5208 |pT~ ||142 |[7T 



Description 

gp: [GI :g4151243] [LN: AF063590] [AC : AF063590] [PNrmicrocin E492 immunity protein] 
[GNrmceB] [OR : Klebsiella pneumoniae] [DB : genpept -bct2 ] [DE : Klebsiella pneumoniae 
microcin E492 precursor (mceA) and microcinE492 immunity protein (mceB) genes, complete 
cds.] [NT: overlapping with mceA] [LE:541] [RE:828] [DI : complement] 



ORF Name 



NT ID 



„ TO NT AA 

AAID — — score 

LengthLength 



IAI7503000986 901515 c3 318 



] [1437 | [5209 |p^~||94 | [gT 



probability 
10.0019 



Description 

gp: [GI:g4731215] [LN: BMMITOCH03] [AC : AF110612] [PN : cytochrome b apoenzyme] [GN:Cytb] 
[OR: Mitochondrion Boophilus microplus] [SR:southern cattle tick] [DB : genpept -inv2] 
[DE:Boophilus microplus cytochrome b apoenzyme (Cytb) gene, partialcds; tRNA-Ser, 
tRNA-Leu, tRNA-Cys , and tRNA-Met genes, completesequence; and NADH dehydrogenase 
subunit 2 (ND2) gene, partial cds, mitochondrial genes for mitochondrial products.] 
[LE:<1] [RE:285] [DI:direct] >gp : [GI : g4 7312 15] [LN : BMMITOCH03 ] [AC : AF110612] 
[PN: cytochrome b apoenzyme] [GN:Cytb] [OR : Mitochondrion Boophilus microplus] 
[SR: southern cattle tick] [DB: genpept] [DE : Boophilus microplus cytochrome b apoenzyme 
(Cytb) gene, partialcds; tRNA-Ser, tRNA-Leu, tRNA-Cys, and tRNA-Met genes, 
completesequence; and NADH dehydrogenase subunit 2 (ND2) gene, partial 
cds, mitochondrial genes for mitochondrial products.] [LE:<1] [RE: 285] [DI: direct] 
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Description 
NO-HIT 
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AAID — — score 

LengthLength 
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Description 

sp: [LN:YC36_GUITH] [AC:O78501] [GN:YCF36] [OR : GUILLARDIA THETA] [SR : , CRYPTOMONAS PHI] 
[DE: HYPOTHETICAL 18.4 KD PROTEIN YCF3 6] [SP:O78501] [DB : swissprot] >gp : [GI : g3603 031] 
[LN:AF041468] [AC:AF041468 :X14171 :X62349 :X51511 :X14504 :X52158 :X52912 : X56806 : M7654 7] 
[PN: hypothetical chloroplast RF36] [GN:ycf36] [OR : Chloroplast Guillardia theta] 
[SR: Guillardia theta] [DB : genpept -pln2] [DE : Guillardia theta complete plastid genome.] 
[LE:88514] [RE:88981] [DI: direct] 
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NO-HIT 
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Description 
NO-HIT 
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probability 
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Description 

gp: [GI:dl042605:g5103194] [LN : AP000342 ] [AC : AP000342] [GNrydhA] [OR : Plasmid R100] 
[SR:Plasmid R100 (lab_host : Escherichia coli strain K-12) DNA] [DBigenpept] [DE : Plasmid 

R100 genomic DNA.] [NT: 58% identical (1 gap) to 376 residues of 404 aa] [LE: 37813] 
[RE: 39018] [DI : complement] 
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Description 
NO-HIT 
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probability 
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Description 
NO-HIT 
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Description 
NO-HIT 

ORF Name 



NT 



AA 



NTID AAID — , — 



LengthLength 



score 



[AI7503000987 10662763 c2 764 
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probability 
9 . 2e-68 



Description 

sp : [LN : YACL_BACSU] [AC:Q06754] [GN : YACL] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 40 . 9 
KD PROTEIN IN MECB-GLTX INTERGENIC REGION] [SP:Q06754] [DB : swissprot] >pir: [LN:S66118] 
[AC:S66118 :D69741 ] [PN : conserved hypothetical protein yacL] [GNryacL ] [CL : conserved 
hypothetical protein yacL] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl005865 :g467477] 
[LN:BAC180K] [AC:D26185] [PN:unknown] [ORrBacillus subtilis] [SR:Bacillus subtilis 
(sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE: 172273] [RE: 173373] [DI:direct] 
>gp: [GI :ell82022 :g2632356] [LN : BSUB0001] [AC : Z99104 : AL009126 ] [GN:yacL] [FN:unknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] [SP:Q06754] 
[LE: 108671] [RE: 109771] [DI: direct] 
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Description 

sp: [LN:ER19_YEAST] [AC:P32377] [GN:ERG19 :MVD1 :MPD : YNR043W :N3427] [OR : SACCHAROMYCES 
CEREVISIAE] [SR: , BAKER 1 S YEAST] [EC : 4 . 1 . 1 . 33] [DE : PYROPHOSPHATE DECARBOXYLASE)] 

[SP:P32377] [DB : swissprot] >pir : [LN: S63374] [AC : S63374 : S2 0057 ] [PN : diphosphomevalonate 
decarboxylase, : protein N3427 : protein YNR043w] [GN : MVD1 : ERG1 9 : MPD ] [OR : Saccharomyces 
cerevisiae] [EC : 4 . 1 . 1 . 33] [DB:pir2] [MP:14R] >gp : [GI : e238625 : gl292890] [LN:SCERG19] 

[AC:X97557] [PN : diphosphomevalonate decarboxylase] [GN : ERG19] [OR : Saccharomyces 
cerevisiae] [SR:baker's yeast] [DB :genpept-plnl] [EC : 4 . 1 . 1 . 33] [DE : S . cerevisiae ERG19 
gene.] [SP:P32377] [LE:544] [RE:1734] [DIrdirect] >gp : [GI : e239591 :gl302550] 

[LN:SCYNR043W] [AC : Z71658 : Y1313 9] [GN:MVD1] [OR : Saccharomyces cerevisiae] [SR:baker's 
yeast] [DB:genpept-plnl] [DE : S . cerevisiae chromosome XIV reading frame ORF YNR043w.] 

[NT : ORF YNR043w] [SP:P32377] [LE:229] [RE: 1419] [DI:direct] 
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Description 

pir: [LN:E71854] [AC:E71854 ] [PN : hypothetical protein jhpl053] [GN: jhpl053 ] 
[OR Helicobacter pylori] [SR:strain J99, , strain J99] [SR:strain J99, ] [DB:pir2] 
>gp: [GI:g4155644] [LN : AE001533 ] [AC : AE001533 : AE00143 9] [PN : putative] [GN: jhpl053] 
[OR : Helicobacter pylori J99] [DB :genpept-bct2] [DE : Helicobacter pylori, strain J99 
section 94 of 132 of the completegenome . ] [NT: similar to H. pylori 26695 gene HP1124] 
[LE:5370] [RE: 6365] [DI : complement] 
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probability 
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Description 

pir: [LN:T03492] [AC:T03492 ] [PN : hypothetical protein] [OR : Rhodobacter capsulatus] 
[DB:pir2] [MP:1 ] >gp : [GI :g3128293] [LN : AF010496] [AC : AF010496] [PN : hypothetical 
protein] [OR: Rhodobacter capsulatus] [DB :genpept-bct2] [DE : Rhodobacter capsulatus 
strain SB1003, partial genome.] [LE: 54291] [RE: 55613] [DI : complement] 



ORF Name 
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Description 
NO-HIT 
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Description 

Sp: [LN:YXEI_BACSU] [AC:P54948] [GN : YXEI : LP 9 A] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
37.2 KD PROTEIN IN IDH-DEOR INTERGENIC REGION] [SP:P54948] [DB : swissprot] 
>pir: [LN:C70075] [AC:C70075 ] [PN : penicillin amidase homolog yxel] [GNryxel ] 
[CL: choloylglycine hydrolase] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84679:g2636500] [LN: BSUB0021] [AC : Z99124 : AL009126] [GN:yxeI] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
21 of 21): from 3999281to 4214814.] [NT:similar to penicillin amidase] [SP:P54948] 
[LE:62053] [RE:63039] [DI : complement] >gp : [GI : dl008921 :gl408494] [LN:D45912] 
[AC:D45912] [GN:yxeI] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1 
(Marburg 168; trpC2) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis genome sequence 
between the iol and hut operon, partial and complete cds . ] [NT : homologous to penicillin 
acylase] [LE:8437] [RE:9423] [DI: direct] 
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Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF1] 

[LE : 1101] [RE : 1922 ] [DI : complement] 
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Description 

pir: [LN:B70148] [AC:B70148 ] [PN : ribosomal protein S12] [GN : BB0387 : rpsL ] 
[CL: Escherichia coli ribosomal protein S12] [OR:Borrelia burgdorferi] [SR:, Lyme 
disease spirochete] [DB:pir2] >gp : [GI : g2688295] [LN : AE001144] [AC : AE001144 : AE000783] 
[PN:ribosomal protein S12 (rpsL) ] [GN:BB0387] [OR:Borrelia burgdorferi] [SR:Lyme 
disease spirochete] [DB : genpept -bct2 ] [DE:Borrelia burgdorferi (section 30 of 70) of 
the complete genome.] [NT: similar to SP:P18662 percent identity: 79.03;] [LE:1809] 
[RE : 2183] [DI : complement] 
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Description 

pir: [LN:S72278] [AC : S72278 : S78483 ] [PN : ATP -dependent Clp proteinase, homolog] 
] [OR:plastid Plasmodium falciparum] [EC : 3 . 4 . 21 . -] [DB:pir2] 



[GN:clpC 
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Description 

pir : [LN:D69906] [AC:D69906 ] [PN : hypothetical protein yojG] [GNryojG ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI : ell85418 :g2634339] [LN : BSUB0011] [AC : Z99114 : AL009126] 
[GN:yojG] [FNiunknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900 J [LE:120715] [RE:121143] 
[DI: complement] >gp : [GI : g3169323] [LN: AF026147] [AC : AF026147] [PN : YojG] [GN: yojG] 
[FNrunknown] [OR:Bacillus subtilis] [DB : genpept -bct2 ] [DErBacillus subtilis Yoj A 
(yojA) , YojB (yojB) , YojC (yojC) , YojD(yojD), Yoj E (yojE) , Yoj F (yojF) , YojG (yojG), 
YojH (yojH), Yoj I (yoj I) f YojJ (yoj J) , Yoj K (yojK) , YojL (yojL) , YojM (yojM) , 
YojN(yojN), and YojO (yojO) genes, complete cds; and OdhA (odhA) gene,partial cds.] 
[LE:4122] [RE:4550] [DIrdirect] 
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Description 

sp: [LN:THID_BACSU] [AC:P39610] [GN : THID : IPA- 52R] [OR:BACILLUS SUBTILIS] [EC:2.7.4.7] 
[DE: (HMP-P KINASE) ] [SP:P39610] [DB : swissprot] >pir : [LN : S39707] [AC : S3 9707: F6 9722 ] 
[PN:phosphomethylpyrimidine kinase thiD: protein ipa-52r] [GN:thiD ] 
[CL:phosphomethylpyrimidine phosphate kinase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:g413976] [LN : BSGENR] [AC:X73124] [GN:ipa-52r] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39610] [LE:55788] 
[RE:56603] [DI : complement ] >gp: [GI :ell86301:g2 636337] [LN : BSUB0020] 
[AC : Z99123 : AL009126] [PN : phosphomethylpyrimidine kinase] [GN:thiD] [FN:thiamin 
biosynthesis] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.7.4.7] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: ywdB, ipa-52r] [SP:P39610] [LE:101359] [RE:102174] [DI:direct] 
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Description 

gp: [GI:g3025463] [LN: CAU58131] [AC:U58131] [PN:SigX] [GN:sigX] [OR : Clostridium 
acetobutylicum] [DB : genpept- bct2] [DE : Clostridium acetobutylicum pho-sigX gene region, 
phoP, phoR, sigX,orf36, and orfl8 genes, complete cds.] [NT:proposed ECF subfamily RNA 
polymerase sigmaf actor] [LE:3293] [RE: 3847] [DI: direct] 
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Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT.-ORFl] 

[LE:1101] [RE: 1922] [DI : complement] 
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Description 

sp: [LN:YAAJ_BACSU] [AC:P21335] [GN:YAAJ] [OR: BACILLUS SUBTILIS] [DE : HYPOTHETICAL 17.8 
KD PROTEIN IN SERS-DNAH INTERGENIC REGION] [SP:P21335] [DB : swissprot] >pir : [LN : S11690] 
[AC:S11690:S66048 :B69737 ] [PN : conserved hypothetical protein yaaJ] [GN:yaaJ ] 
[CL: hypothetical protein Yaaj] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl005796:g467408] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . 

subtilis DNA, 180 kilobase region of replication origin.] [LE: 89452] [RE: 89937] 

[DI: direct] >gp : [GI :g40011] [LN:BSORF17] [AC:X52144] [OR:Bacillus subtilis] 

[DB: genpept-bctl] [DE:Bacillus subtilis DNA for ORF17, small cytoplasmic RNA and 

partialdnaX gene.] [NT:ORF17 (AA 1-161)] [SP:P21335] [LE:129] [RE: 614] [DI:direct] 

>gp: [GI:ell81951:g2632285] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yaaJ] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 

1 of 21) : from 1 to213080.] [NT: similar to hypothetical proteins] [SP:P21335] 
[LE:25850] [RE:26335] [DI: direct] 
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Description 

pir: [LN:A71042] [AC:A71042 ] [PN:probable mevalonate kinase] [GN:PH1625 ] 
[OR: Pyrococcus horikoshii] [DB:pir2] >gp: [GI :dl031680 :g3258054] [LN : AP000006] 
[AC:AP000006 : AB005215 : AB009510 : AB009511 : AB009512 :AB009513 :AB009514] [PN:335aa long 
hypothetical mevalonate kinase] [GN:PH1625] [OR : Pyrococcus horikoshii] [SR : Pyrococcus 
horikoshii (strain :OT3) DNA, clone : Pyrococcus horikoshi] [DB : genpept-bctl] 
[DE: Pyrococcus horikoshii OT3 genomic DNA, 1166001-1485000 nt . position (6/7 ). ] 
[NTrsimilar to owl:MTU47134 percent identity: 41.993 in] [LE:275590] [RE:276597] 
[DI : complement] 
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Description 

gp: [GI:g2897106] [LN : AF020798] [AC : AF020798] [PN : repressor] [OR : Streptococcus 
thermophilus bacteriophage TP-J34] [DB : genpept -phg] [DE : Streptococcus thermophilus 
bacteriophage lysogeny module, integrasehomolog (int) , putative host cell 
surface -exposed lipoprotein, putative metallo-proteinase, repressor, Cro-like 
regulatoryprotein, and Pl-antirepressor homolog genes, complete cds . ] [NT: CI -like 
regulatory protein; orfl21] [LE:4825] [RE: 5190] [DI : complement] 



ORF Name 



NTID 



^ Tr , NT AA 

AAID — — . score 

Length Length 



probability 



IAI7B03000987 14885260 c2 839 



[1475 | [ 5247 | p^T"| | 146 | p5~-|| b . 3e-17 "| 
Description 

gp: [GI:g2444132] [LN.-U88974] [AC:U88974] [PN:ORF25] [OR : Streptococcus thermophilus 
temperate bacteriophage O1205] [DB : genpept -phg] [DE : Streptococcus thermophilus 
temperate bacteriophage O1205, completegenome . 3 [NT:putative small subunit of the 
terminase] [LE: 13835] [RE: 14329] [DI:direct] 
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Description 

gp: [GI:gl458327] [LN : CELF08F3] [AC:U64847] [GN:F08F3.4] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept -invl] [DE : Caenorhabditis 
elegans cosmid F08F3.] [LE : 1515 : 1817 : 2044 : 2841] [RE : 1758 : 1994 : 2276 : 3110] 
[DI : direct Join] 



367 



ORF Name 



NTID 



NT 



AA 



AAID _ ^ „ 



LenqthLength 



score probability 



1A1 7 503000987 158929 32 c3 902 



[1478 | [5250 |[TT9 | |42 [ 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID _ . , . 



Length Length 



score 



E 



17503000987 16054827 c3 959 



psi |p37 — | [lie |pr- 



probability 
2.4e-14 



Description 

pir: [LN:T00183] [AC:T00183 ] [PN : hypothetical protein 50] [OR : Staphylococcus aureus 
phage phi PVL] [DB:pir3] >gp : [GI : dl032884 :g3341957] [LN : AB009866] [AC : AB009866] 
[OR: bacteriophage phi PVL] [SR : bacteriophage phi PVL (specif ic_host : Staphylococcus 
aureus ATC] [DB : genpept -phg] [DE : Bacteriophage phi PVL proviral DNA, complete 
sequence.] [NT:orf 50] [LE:36701] [RE:37069] [DIrdirect] 



ORF Name 



NT 



AA 



NTID AAID — . T — , , score probabili ty 
LengthLength J ~ 



U!7503000987 15212803 c2 844 



1480 



[5252 |pT3~||105 ||124 ||5.4e-08 



Description 

pir : [LN:S58144] [AC:S58144 ] [PN:gene 15 protein] [OR:phage SPP1] [DB:pir2] 

>gp: [GI :e244843 :g2764862] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 

[DB: genpept -phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT: gene 15] 

[LE:9012] [RE:9320] [DI:direct] >gp : [GI : gl052813 ] [LN : SPP1HEADG] [AC:X89721] [GN:15] 

[OR: Bacteriophage SPP1] [DB : genpept -phg] [DE : Bacteriophage SPP1 head morphogenesis 

genes 7 to 15.] [NT:product required for head morphogenesis] [LE:5788] [RE: 6096] 
[DI : direct] 



ORF Name 



U7503000987 16413130 £1 128 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 
"] [1481 | [ 5253 || 245 ~| |81 | p^~||l . 8e-08 



Description 

gp: [GI:g3929312] [LN : AF100426] [AC : AF100426] [PN : f imbriae-associated protein Fapl] 
[GN:fapl] [OR: Streptococcus parasanguinis] [DB : genpept -bct2] [DE : Streptococcus 
parasanguis f imbriae-associated protein Fapl (fapl) gene, complete cds . ] [NT:invovled in 
fimbriae assembly and f imbriae-mediated] [LE:284] [RE: 7996] [DI: direct] 



ORF Name 



1AI7503000987 15503427 c2 851 



NT AA 
NTID AAID — , — , score 
LengthLength 

[1482 | [5254 ||S3i | [175 | 



probability 



Description 
NO-HIT 

ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



117503000987 15581587 cl 535 



TT8T 



|5255 |preg~| |342 ||594 | [ 



probability 
|2.1e-58 



Description 

sp: [LN:YACI_BACSU] [AC:P37570] [GN : YACI] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 41 . 1 
KD PROTEIN IN LYSS-MECB INTERGENIC REGION (ORFX) ] [SP:P37570] [DB : swissprot] 
>pir: [LN:S66114] [AC : S66114 : 140507 :B69741 ] [PN:creatine kinase homolog yacl] [GN:yacI 
] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI : dl005861 : g467473] [LN:BAC180K] [AC:D26185] 
[PN:unknown] [OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, 
strain: 168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of 
replication origin.] [LE:166083] [RE:167174] [DI:direct] >gp : [GI : ell82018 :g2632352] 
[LN:BSUB0001] [AC : Z99104 : AL009126] [GN:yacI] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 1 of 21) : from 1 
to213080.] [NT:similar to creatine kinase] [SP:P37570] [LE:102481] [RE:103572] 
[DI:direct] 



368 



ORF Name 



NTID 



NT AA 
AAID — *^ — . . score 
LengthLength 



F 



ivbOiooosuv wmwm ci £,46 



][ 



TT8T" 



[ 3558 ||TTF5— | |b906 | [ 



probability 
T03 



Description 

sp: [LN:RPOB_STAAU] [AC:P47768] [GN:RPOB] [OR : STAPHYLOCOCCUS AUREUS] [EC:2.7.7.6] 
[DE : BETA CHAIN) (RNA POLYMERASE BETA SUBUNIT) ] [SP:P47768] [DB : swissprot ] 
>pir: [LN:S59951] [AC:S59951 ] [PN : DNA-directed RNA polymerase, beta chain] [GNirpoB ] 
[CL:DNA-directed RNA polymerase beta chain] [OR : Staphylococcus aureus] [EC: 2. 7. 7. 6] 
[DB:pir2] >gp: [GI :g677851] [LN : SARPLRPO] [AC:X64172] [PN : DNA-directed RNA polymerase 
beta chain] [GN : rpoB] [OR : Staphylococcus aureus] [DB :genpept-bctl] [EC:2.7.7.6] 
[DE:S. aureus rplL, orf202, rpoB(rif) and rpoC genes for ribosomalprotein L7/L12, 
hypothetical protein ORF202, DNA-directed RNApolymerase beta & beta' chains.] 
[SP:P47768] [LE:1222] [RE:4770] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID , . 



LengthLength 



score probability 



IAI7503000987 16972575 tl 196 



] [1485 | [5257 ||Igg~| |61 | [76 ||0 , 036 



Description 

gp: [GI:g4580755] [LN : AF061085] [AC : AF061085] [PN : P-glycoprotein] [OR : Gossypium 
herbaceum] [DB : genpept -pln2 ] [DE : Gossypium herbaceum P-glycoprotein gene, partial cds . ] 
[NT: similar to P-glycoprotein in Hordeum vulgare and] [LE:<22] [RE:>894] [DI: direct] 



ORF Name 



NT 

NTID AAID _ 



AA 



LengthLength 



score 



117503000987 1702 t2 209 



] [I486 | [5258 [T7T 



probability 
0.0089 



Description 

pir: [LN:T00168] [AC:T00168 ] [PN : hypothetical protein 33] [OR : Staphylococcus aureus 
phage phi PVL] [DB:pir3] >gp : [GI : dl032869 :g3341942] [LN: AB009866] [AC : AB009866] 
[OR: bacteriophage phi PVL] [SR : bacteriophage phi PVL (specif ic_host : Staphylococcus 
aureus ATC] [DB :genpept-phg] [DE : Bacteriophage phi PVL proviral DNA, complete 
sequence.] [NT:orf 33] [LE:28172] [RE:28582] [DI : complement] 



ORF Name 



NTID AAID . — 



1487 



5259 



NT AA 
LengthLength 



score 



probability 
b.3e-65 



|AI7503000987_19690876_c3_935 

Description 

sp: [LN:YWFL_BACSU] [AC:P39648] [GN: YWFL: IPA-90D] [OR:BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 31.4 KD PROTEIN IN PTA 3' REGION] [SP:P39648] [DB : swissprot] 
>pir: [LN:S39745] [AC : S3 9 74 5 :D70056 ] [PN:ywfL protein : hypothetical protein ipa-90d] 
[GN:ywfL ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g414014] [LN : BSGENR] [AC:X73124] 
[GN:ipa-90d] [OR:Bacillus subtilis] [DB:genpept-bctl] [DE : B . subtilis genomic region 
(325 to 333).] [SP:P39648] [LE:93300] [RE:94145] [DI: direct] >gp : [GI : ell86264 :g2636300] 
[LN:BSUB0020] [AC : Z99123 : AL009126 ] [GN:ywf L] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 20 of 21) : from 
3798401to 4010550.] [NT : alternate gene name: ipa-90d] [SP:P39648] [LE:63814] [RE:64659] 
[DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



^7503000987 19695366 c3 953 



1488 



[5260 |[771~]p56 ~| pr 



IT" 



probability 
0.00059 



Description 

pir : [LN:T00180] [AC:T00180 ] [PN : hypothetical protein 46] [OR : Staphylococcus aureus 
phage phi PVL] [DB:pir3] >gp : [GI : dl032881 :g3341954] [LN : AB009866] [AC : AB009866] 
[OR bacteriophage phi PVL] [SR : bacteriophage phi PVL (specif ic_host : Staphylococcus 
aureus ATC] [DB : genpept-phg] [DE : Bacteriophage phi PVL proviral DNA, complete 
sequence.] [NT:orf 46] [LE:35157] [RE:36050] [DI:direct] 



369 



ORF Name 



NTID 



NT AA 
AAID — _ — . , score 
LengthLength 



E 



17503000987 20078287 ±3 533 



probability 
1489 | [5251 |pB~]| 274 | [ 14b | |2.5e-07 



Description 

pir : [LN:B70798] [AC:B70798 ] [PN:probable membrane protein] [GN:Rv3737 ] 
[OR: Mycobacterium tuberculosis] [DB:pir2] >gp : [GI : el264597 :g2960161] [LN:MTV025] 
[AC:AL022121:AL123456] [PN : hypothetical protein Rv3737] [GN:Rv3737] [OR : Mycobacterium 

tuberculosis] [DB :genpept-bctl] [DE : Mycobacterium tuberculosis H37Rv complete genome; 

segment 155/162.] [NT:Rv3737, (MTV025 . 085) , len: 529. Probable membrane] [LE:94796] 
[RE:96385] [DI:direct] >gp : [GI :el264597:g2 960161] [LN:MTV025] [AC : AL022121 : AL123456] 
[PN: hypothetical protein Rv3737] [GN:Rv3737] [OR Mycobacterium tuberculosis] 
[DB:genpept] [DE Mycobacterium tuberculosis H37Rv complete genome; segment 155/162.] 
[NT:Rv3737, (MTV025 . 085 ) , len: 529. Probable membrane] [LE:94796] [RE:96385] 
[DI : direct] 



ORF Name 



NTID 



7V7VTr , NT AA 

AAID — , _ — , . score 

LengthLength 



probability 



^7503000387 20087752 £2 352 



1490 | [5252 ||r5Pg"||S0i | pT7~||2 . 5e-36 



Description 

sp: [LN:TAGE_BACSU] [AC:P13484] [GN : TAGE : RODD : GTAA] [OR:BACILLUS SUBTILIS] [EC : 2 . 4 . 1 . 52] 
[DE:(EC 2.4.1.52) (TEICHOIC ACID BIOSYNTHESIS PROTEIN E) ] [SP:P13484] [DB : swissprot] 
>pir: [LN:S06048] [AC : S06048 : F6972 0 ] [PN : poly (glycerol -phosphate) 
alpha-glucosyltransf erase, tagE : probable rodD protein :UDP-glucose- -polyglycerol 
phosphate glucosyltransf erase tagE] [GN : tagE : rodD ] [OR: Bacillus subtilis] [EC: 
2.4.1.52] [DB:pir2] [MP: 310 degrees ] >gp : [GI:g580920] [LN : BSRODC] [AC:X15200] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis rodC operon.] [NT : rodD 
(gtaA) polypeptide (AA 1-673)] [SP:P13484] [LE:157] [RE:2178] [DI:direct] 
>gp: [GI:ell84479:g2636099] [LN: BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose : polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 4 . 1 . 52] [DE:Bacillus subtilis complete 

genome (section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, 

gtaA, gtaD] [SP:P13484] [LE : 80369] [RE: 82390] [DI : complement] 

>gp: [GI:ell84479:g2636099] [LN : BSUB0019] [AC : Z99122 : AL009126] 

[PN:UDP-glucose :polyglycerol phosphate] [GN:tagE] [FN:teichoic acid biosynthesis] 
[OR:Bacillus subtilis] [DB:genpept] [EC : 2 . 4 . 1 . 52] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT : alternate gene name: rodD, gtaA, gtaD] 
[SP:P13484] [LE:80369] [RE:82390] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _^ — , - score 
LengthLength 



probability 



[AI7503000987 20322153 c2 818 



] [1491 | [5263 ||2T5— 1|72 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — . T — .score 
LengthLength 



IAI7503000987 20331552 cl 659 



J 



5264 11996 I \Ttt 



probability 
3.2e-42 



Description 

pir : [LN:C69066] [AC:C69066 ] [PN : ornithine cyclodeaminase] [GN:MTH1495 ] 
[OR:Methanobacterium thermoautotrophicum] [DB:pir2] >gp : [GI :g2622612] [LN: AE000910] 
[AC:AE000910 :AE000666] [PN : ornithine cyclodeaminase] [GN:MTH1495] [OR : Methanobacterium 
thermoautotrophicum] [DB :genpept-bct2] [DE : Methanobacterium thermoautotrophicum from 
bases 1349621 to 1362200 (section 116 of 148) of the complete genome.] [NT:Function 
Code: 5. 09 - L-Amino Acid Metabolism,] [LE:963] [RE: 1982] [DI : complement] 



370 



NT AA 

ORF Name NT ID AAID Le ^ thLe ^ t h scor e probability 



[A17S03000987J>0bllb90_c3_B93 | [1493 | [ 5255 ||3fl4 | [127 |[" 414 || 1.0e-iB | 

Description 

sp: [LN:RL7_MICLU] [AC:P02395] [GN : RPLL] [OR : MICROCOCCUS LUTEUS] [SR :, MICROCOCCUS 
LYSODEIKTICUS] [DE:50S RIBOSOMAL PROTEIN L7/L12 (MA1/MA2 ) ] [SP:P02395] [DB : swissprot] 
>pir: [LN:R7MCML] [AC:A02771 ] [PN : ribosomal protein L7/L12 : ribosomal protein MA] 
[CL: Escherichia coli ribosomal protein L12] [OR : Micrococcus luteus : Micrococcus 
lysodeikticus] [DBipirl] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 

[AI7503000987_2075S26u_ci_729 | [1494 | [52 66 || 40Q ~| |pT-] p.4e-i4 | 

Description 

pir: [LN:T00194] [AC:T00194 ] [PN : hypothetical protein 61] [OR : Staphylococcus aureus 
phage phi PVL] [DB:pir3] >gp : [GI : dl032895 :g3341968] [LN: ABO 098 66] [AC : AB009866] 
[OR: bacteriophage phi PVL] [SR : bacteriophage phi PVL (specif ic_host : Staphylococcus 
aureus ATC] [DB : genpept -phg] [DE : Bacteriophage phi PVL proviral DNA, complete 
sequence.] [NT:orf 61] [LE:39932] [RE:40402] [DIrdirect] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 

[A17S030005fl7_209S0262_c2_762 | [1495 | [ 5257 | p7T] |823 | pF^]| 0 . 0 "| 

Description 

sp : [LN : MECB_BACSU] [AC:P37571] [GN : MECB : CLPC] [OR:BACILLUS SUBTILIS] [DE : NEGATIVE 
REGULATOR OF GENETIC COMPETENCE MECB] [SP:P37571] [DB : swissprot] >pir : [LN : 14 0508] 
[AC:I40508:S66115:I40385:H69600 ] [PN:class III stress response- related ATPase 
clpC: adenosine triphosphatase clpC : clpA/clpB protein homolog] [GN: clpC :mecB ] 
[CL: ATP -dependent Clp proteinase chain A] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005862:g467474] [LN:BAC180K] [AC:D26185] [PN : clpA/clpB family] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] 
[DB:genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:167171] [RE:169603] [DI:direct] >gp : [GI : g442360] [LN : BSU02604] [AC:U02604] [PNrClpC 
adenosine triphosphatase] [GN:mecB] [FN : competence gene repressor; required for cell] 
[OR: Bacillus subtilis] [DB : genpept -bctl] [DE: Bacillus subtilis Marburg 168 ClpC 
adenosine triphosphatase (mecB)gene, complete cds, orfX and orfY, partial cds.] 
[LE:335] [RE:2767] [DI:direct] >gp : [GI : el 182019 :g2 632353] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [PN:class III stress response-related ATPase] [GN:clpC] 
[FN: negative regulator of late competence genes;] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 1 of 21) : from 1 
to213080.] [NT:alternate gene name: mecB] [SP:P37571] [LE:103569] [RE:106001] 
[DI :direct] 



NT AA 

ORF Name NT ID AAID — — , , score probability 
Length Length 

|AI7S030009B7_2117202_c2_83i | [1496 | [ 5268 |[T77— | p | 

Description 

NO-HIT 

NT AA 

ORF Name NTID AAID — _ — , score probability 
Length Length 

A17!>030009flV_214B446B_c2_84B | [1497 | [ 5259 |p^~[ | 171 | p || 0.019 



Description 

gp: [GI:g3702331] [LN : ATAC0053 97] [AC : AC0053 97] [GN : T3F17 . 17] [OR : Arabidopsis thaliana] 
[SR:thale cress] [DB : genpept -pln2] [DE : Arabidopsis thaliana chromosome II BAC T3F17 

genomic sequence , complete sequence.] [NT : hypothetical protein] 
[LE: 52308 : 52593 : 52780 : 53023] [RE : 52505 : 52691 : 52902 : 53157] [DI : direct Join] 



371 



ORF Name 



NT ID 



NT AA 
AAID „ — _ T — , . score 
LengthLength 



1 7 503000967 21b988J8 r3 534 



Description 

sp: [ LN : Y J JP_HAE IN] [AC:P4452 
PROTEIN HI0108] [SP:P44520] 
[PN: hypothetical protein HIO 
>gp: [GI:gl573061] [LN:U32696 
[GN : HI 0 1 0 8 ] [ OR : Haemophi lus 
Rd section 11 of 163 of the 
PID:537207 GB:U00096] [LE:40 



[1498 | [5270 |pTS3 — | [lb4 [[Im- 



probability 
7.2e-li 



0] [GN:HI0108] [OR : HAEMOPHILUS INFLUENZAE] [DE : HYPOTHETICAL 

[DB:Swissprot] >pir : [LN: 164142] [AC:I64142 ] 

108] [OR Haemophilus influenzae] [DB:pir2] 

] [AC:U32696 :L42023] [PN : conserved hypothetical protein] 

influenzae Rd] [DB :genpept-bct2] [DE : Haemophilus influenzae 

complete genome.] [NT: similar to GB:U14003 SP:P39402 

95] [RE.-4988] [DI:direct] 



ORF Name 



^17503000987 21603777 tl 144 



Description 
NO-HIT 



NT 



AA 



NT ID AAID Le ~ thLe ~ th score probability 



[5271 | [41 | 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



\I7503000987 21758468 C3 992 



[1500 | [5272 |pnrg — I | 162 | |TTr~| [6 . 9e-08 — | 
Description 

pir: [LN:B70351] [AC:B70351 ] [PN : ribosomal -protein-alanine acetyltransf erase] [GN:rimI 
] [CL: Escherichia coli peptide N -acetyl transferase riml] [OR:Aquifex aeolicus] 
[DB:pir2] >gp: [GI : g2983204] [LN : AE000696] [AC : AE000696 : AE000657] 

[PN:ribosomal-protein-alanine acetyltransf erase] [GN:rimI] [OR:Aquifex aeolicus] 
[DB :genpept-bct2] [DE:Aquifex aeolicus section 28 of 109 of the complete genome.] 
[LE:3558] [RE: 4022] [DI : complement] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



7503000987 22042128 t3 519 



Description 
NO-HIT 



[5273 |p3T? | p~ 



ORF Name 



IA17503000987 22069160 cl 652 



NT ID 



NT 



AA 



AAID 

LengthLength 



score 



[5274 |p3~ | [130 |pT 



probability 
7.1e-6i 



Description 

sp: [LN:RS12_STAAU] [AC:P48942] [GN: RPSL] [OR : STAPHYLOCOCCUS AUREUS] [DE:30S RIBOSOMAL 
PROTEIN S 12] [SP:P48942] [DB : swissprot] >gp : [GI :g706921] [LN: SAU20869] [AC:U20869] 
[PN:ribosomal protein S12] [GN : rpsL] [OR : Staphylococcus aureus] [DB : genpept-bct2] 
[DE staphylococcus aureus ribosomal protein S12 (rpsL) gene, completecds, ribosomal 
protein S7 (rpsG) and ORF 1 genes, partial cds . ] [LE:418] [RE: 831] [DI: direct] 



ORF Name 

|AI7503000987J>2ii036^t2_414 

Description 

NO-HIT 



NT AA 
NT ID AAID — _ „ — _ score 
Length Length 

1503 | [5275 ||174 | [57 I 



probability 



ORF Name 



|Ar;503000987_22272200_ciJ735- 

Description 
NO-HIT 



NT ID AAID — _ , — , , score 
LengthLength 

[1504 I |5276 |pT9 | |312 | 



probability 



372 



ORF Name 



NTID 



NT AA 
AAID — _ T — , , score 
LengthLength 



117503000987 22400^61 c2 843 



J [ 1505 | [5277 ||B34 | |277 | pi" 



probability 
.9e-62 



Description 

gp: [GI :el39438 :gl369939] [LN:BTP9011] [AC:X84706] [PN:major head protein] [GN:mhp] 
[OR: Bacteriophage Bl] [DB :genpept-phg] [DE : Bacteriophage TP901-1 genomic region.] 
[LE:360] [RE: 1181] [DI: direct] 



ORF Name 



IAI7503000987 22539010 t3 603 



NT AA 
NTID AAID — » — , - score 
LengthLength 

[1506 | [5278 ||^~| |42 | 



probability 



Description 
NO-HIT 

ORF Name 



AA 



NT 

NTID AAID — . T — , , scor e 
LengthLength 



probability 



^17503000987 22539186 FT 67 



] [1507 | [5279 |pg— |p" 



Description 
NO-HIT 

ORF Name 



NTID AAID 



VI7503000987 22689067 ti 205 



NT AA 
LengthLength 
T77T 



score probability 



1218 ||6.3e-124 



Description 

sp: [LN:NUPC_BACSU] [AC:P39141] [GN:NUPC] [OR:BACILLUS SUBTILIS] [DE : PYRIMIDINE 
NUCLEOSIDE TRANSPORT PROTEIN] [SP:P39141] [DB : swissprot ] >gp : [GI : dl008934 :gl408507] 
[LN:D45912] [AC:D45912] [PN : pyrimidine nucleoside transport protein] [GN:nupC] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:BGSC 1A1 (Marburg 168; trpC2) ) 
DNA] [DB :genpept-bctl] [DE: Bacillus subtilis genome sequence between the iol and hut 
operon, partial and complete cds . ] [LE:20443] [RE:21624] [DI: direct] 



J 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IA27503000987 22694377 c2 776 



] [1509 | [5281 |[T¥T~||246 | |7g2~||i . 3e-75 



] 



Description 

pir: [LN:S59955] [AC:S59955 ] [PN : hypothetical protein 202] [CL : hypothetical protein 
MJ0882] [OR: Staphylococcus aureus] [DB:pir2] >gp : [GI : g677850] [LN : SARPLRPO] [AC:X64172] 
[PN: hypothetical protein] [GN:ORF202] [OR : Staphylococcus aureus] [DB :genpept-bctl] 
[DE:S. aureus rplL, orf202, rpoB(rif) and rpoC genes for ribosomalprotein L7/L12, 
hypothetical protein ORF202, DNA-directed RNApolymerase beta & beta' chains.] [LE:399] 
[RE: 1007] [DI: direct] 



ORF Name 



NTID 



7v7vTr ^ NT AA 

AAID — — score 

LengthLength 



probability 



|AI7503000987_22773302_cl_741 



J [1510 | [5282 |JTgTTT]| 600 | [ 1371 || 3 . 9e-140 



Description 

gp: [GI:e286568:g2764983] [LN : BP187PLYH] [AC:Y07740] [PN:cell wall hydrolase Plyl87] 
[GN:plyl87] [OR staphylococcus phage 187] [DB :genpept-phg] [DE : Staphylococcus phage 187 
plyl87 and holl87 genes.] [LE:222] [RE: 2108] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID — ^.^ — . . score pro bability 
LengtnLengtn 



^17503000987 22790941 c2 847 



1511 | [ 5283 [pi - 1| 180 | p^3—| |5.2e-10 



Description 

gp: [GI:e244714:g2764866] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE bacteriophage SPP1 complete nucleotide sequence.] [NT:gene 17.1] 
[LE:10481] [RE:11014] [DI:direct] 



373 



ORF Name 

|AI 7 b03000987 2344213b cl 685 



NT ID 



NT 



AA 



AAID . _ v 

LengthLength 



score probability 



] | 1512 | I 5284 || 5 7 5 1 1224 | pg— 1|7 . 9e-ii 



Description 

pir: [LN:F71309] [AC:F71309 ] [PN : probable phosphoglycolate phosphatase (gph-2)] 
[GN:TP0554 ] [OR : Treponema pallidum subsp. pallidum] [SR:, syphilis spirochete] 
[DB:pir2] >gp : [GI :g3322848] [LN: AE001231] [AC : AE001231 : AE000520] [PN : phosphoglycolate 

phosphatase (gph-2)] [GN:TP0554] [OR : Treponema pallidum] [DB : genpept-bct2] 
[DE : Treponema pallidum section 47 of 87 of the complete genome.] [NT: similar to 

GB:L42023 SP:P44755 PID:1004013] [LE:1483] [RE:2151] [DI: direct] 



ORF Name 



NT ID 



NT 



AA 



AAID . — , — 



LengthLength 



score probability 



AI7503000987_23469213_c2_838 

Description 

NO-HIT 



1513 | [5285 ||4ii ~j | 



ORF Name 



NT AA 
NT ID AAID — . — . , score 
LengthLength 



probability 



IAI7S0300098 7 23477213 cl 701 



] [1514 | [5285 |[r0TT]|336 |pTB5"||7 ■ 9e-110 



Description 

pir : [LN:S39743] [AC : S39743 : D69683 ] [PN : phosphotransacetylase pta] [GNrpta ] 
[CL : phosphate acetyltransf erase] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :g580883] 
[LN: BSGENR] [AC:X73124] [GN:ipa-88d] [OR:Bacillus subtilis] [DB : genpept-bct 1] 
[DE:B. subtilis genomic region (325 to 333).] [SP:P39646] [LE:91234] [RE:92205] 
[DI:direct] >gp : [GI :ell86266:g2 636302] [LN:BSUB0020] [AC : Z99123 : AL009126] 
[PN: phosphotransacetylase] [GN:pta] [OR:Bacillus subtilis] [DB : genpept -bctl ] 
[EC:2.3.1.8] [DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 

4010550.] [NT:alternate gene name: ipa-88d, ywfj] [SP:P39646] [LE:65754] [RE:66725] 
[DI : complement] 



ORF Name 



|KT7 



b03000987 23i>M700 c3 1002 



Description 
NO-HIT 

ORF Name 



_ _ NT AA 

NT ID AAID — — score 

Length Length 

[1515 | [5287 ||162 1 153 | 



probability 



Al 7 b03000987_23572178_c3_S5fl 



__ 

NT ID AAID — _ — , . score 
LengthLength 



probability 



Description 
NO-HIT 



] I 1515 1 pes |p | 



ORF Name 



NTID 



|A17b03000987 23601637 c2 794 



NT AA 

AAID — 

Length Length 



score 



\T7T 



probability 
1.7e-34 



Description 

pir: [LN:S60902] [AC : S60902 : S49238 : S44071 ] [PN : CDP-ribitol pyrophosphorylase] 
[OR Haemophilus influenzae] [DB:pir2] >gp : [GI :g471234] [LN: HISBCAL] [AC:X78559] 
[OR : Haemophilus influenzae] [DB : genpept-bctl] [DE :H . influenzae DNA for serotype b 

capsulation locus.] [NT:orfl] [LE:434] [RE: 1858] [DI: direct] 



374 



ORF Name 



NTID 



NT AA 
AAID — , — score 
LenqthLength 



IA17b03000987 2J60JJ91 c3 969 



] [1518 | [5290 |[ZW||97 |pT" 



probability 
10.00030 



Description 

gp: [GI:el389970:g4539393] [LN : ATF28A21] [AC : AL035526] [PN : hypothetical protein] 
[GN:F28A21 . 150] [OR : Arabidopsis thaliana] [SR:thale cress] [DB : genpept-plnl] 
[DE : Arabidopsis thaliana DNA chromosome 4, BAC clone F28A21 (ESSAproject) . ] 
[LE : 62401: 62755 : 63247] [RE : 62693 : 63126 : 6333 7] [DI : complement Join] 



ORF Name 



NT 

NTID AAID _ — 



AA 



Length Length 



score 



E 



17503000987 23617140 Cl 686 



1515 | p91 |pUT][4S7 |pT- 



probability 
6 . Oe-44 



Description 

pir: [LN:A69832] [AC:A69832 ] [PN : long- chain f atty-acid-CoA ligase homolog yhfT] 
[GNryhfT ] [CL: acetate- -CoA ligase homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83038:g2633372] [LN : BSUB0006] [AC : Z99109 : AL009126] [GN:yhfT] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
6 of 21): from 999501 tol209940.] [NT:similar to long-chain f atty-acid-CoA ligase] 
[LE:110451] [RE:111890] [DI : complement] >gp : [GI : e324999 :g2226256] [LN:BSY14084] 
[AC:Y14084] [PN : hypothetical protein] [GN:yhfT] [OR:Bacillus subtilis] 

[DB : genpept-bctl] [DE :B . subtilis chromosomal DNA, region 78-80 degrees: aprE to comK.] 
[NT similarity to long-chain-acyl-CoA synthetase from] [LE:4879] [RE: 6318] 
[DI : complement] 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



1M7S03000987 23673313 F2 408 



] [1520 | [ 5292 | [64 | | 0.006i 



Description 

gp: [GI:g3329651] [LN: CELT17A3] [AC : AF078787] [GN:T17A3.9] [OR : Caenorhabditis elegans] 

[DB:genpept-inv2] [DE : Caenorhabditis elegans cosmid T17A3.] [LE : 15875 : 16120 : 16852] 
[RE : 16060 : 16380 : 17211] [DI : complement Join] 



ORF Name 



U7b03000987 23709631 cl 675 



Description 
NO-HIT 



NTID 



AA 



NT 

AAID — — . . score 
Length Length - 



probability 



[5293 



\m I 



[TO" 



ORF Name 



NTID 



^ m NT AA 

AAID — , — , , score 

LengthLength 



IAI7503000987 23712830 cl 713 



] [1522 | [5294 |[^~| [162 | pTT 



probability 
6.6e-19 



Description 

gp: [GI:g4049992] [LN: AF077306] [AC : AF0773 06] [PN:gpl57] [OR : Streptococcus thermophilus 
bacteriophage Sfil9] [DB : genpept -phg] [DE : Streptococcus thermophilus bacteriophage 
Sfil9 gpl57, gp23 3 , putative helicase , gpl51, gp271, putative primase, and gpl43genes, 
complete cds . ] [NT:orfl57] [LE:1] [RE:474] [DI .-direct] 



NT AA 

ORF Name NTID AAID — _ „ — _ score probability 
LengthLength c 

AI7503000987_23867125_t2_333 | [1523 | [ 5295 | pTT^ — | | 6 7 | p || 0.018 

Description 

pir : [LN:S21443] [AC:S21443 ] [PN : hypothetical protein] [OR : Dictyostelium discoideum] 
[DB:pir2] 



375 



ORF Name 



U7503000987 23910052 t3 585 



NTID AAID — , — score probability 



NT AA 
LengthLength : 
1524 | [5295 ||147 | [55" 



F — I 



Description 
NO-HIT 



ORF Name 



NTID 



U7503000987 2392837 cl 732 



1525 I [5^7 



NT AA 
LengthLength 
VIZ 



score probability 
226 ||S.4e-19 



Description 

gp: [GI:e244844:g2764864] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 

[DB :genpept-phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT:gene 16.1] 
[LE:9632] [RE:10066] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — « — score 
LengthLength 



probability 



VI7503000987 24026077 c3 966 



J [1526 | [ 5298 \\^~\ | 317 | |55"0 [ [3 . 4e-54 " 



Description 

pir: [LN:S58137] [AC : S5813 7 : S244 56 ] [PN:gene 7 protein] [OR:phage SPP1] [DB:pir2] 
>gp: [GI:e244838:g2764848] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB : genpept -phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT: gene 7] 
[LE:3802] [RE:4728] [DIrdirect] >gp : [GI : gl0528 06] [LN : SPP1HEADG] [AC:X89721] [GN:7] 
[OR: Bacteriophage SPP1] [DB : genpept -phg] [DE : Bacteriophage SPP1 head morphogenesis 

genes 7 to 15.] [NT:product required for head morphogenesis] [LE:578] [RE: 1504] 
[DI : direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



F 



7503000987 24026576 c2 854 



T5TT 



Description 
NO-HIT 



[5299 ||180 | 



ORF Name 



NT 



AA 



NTID AAID — — , , score probability 
LengthLength c 



^17503000987 24229837 ci 716 



[5300 [pi - 1|80 ||97 ||3.9e-05 



Description 

pir: [LN:A56273] [AC:A56273 ] [PN : hypothetical protein (bacteriocin saiA 3 1 -region) ] 
[OR Lactobacillus sake] [DB:pir2] >gp : [GI :g599850] [LN: LSSAKACLU] [AC:Z46867] [GN:orfl] 
[OR : Lactobacillus sakei] [DB : genpept -bctl] [DE:L.sake sakacin A gene cluster.] [LE:5] 
[RE : 268] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID , , 

LengthLength 



score 



probability 



IAI7503000987 24258462 c3 982 



] [1529 | [5301 |p5~ ||lil [ pTu~||4 . 8e-TT" 



Description 

gp: [GI:g928831] [LN : BK5TATTP] [AC:L44593] [FN : unidentified] [OR : Lactococcus lactis 
phage BK5-T] [SR : Bacteriophage BK5-T DNA] [DB : genpept -phg] [DE : Bacteriophage BK5-T 
ORF' 410, 3' end pf cds, 20 ORFs, repressorprotein, and Cro repressor protein genes, 
complete cds, ORF70'gene, 5' end of cds.] [NT:ORF95; putative] [LE:7452] [RE:7739] 
[DI : direct] 



ORF Name 



NT AA 
NTID AAID — t — _ score 
LengthLength 



probability 



E 



17503000987 24275342 cl 718 



] [1530 | [5302 ||T73— 1|125 | 



Description 
NO-HIT 



376 



ORF Name 

IA17503000987 24J202i>6 tl 134 



NT AA 
NT ID AAID — « _ — , , score 
LengthLength 



probability 



Description 
NO-HIT 

ORF Name 



NTID 



117503000987 24325127 13 452 



][ 



HIT 



NT AA 

AAID 

LengthLength 

[5304 | [SIR | [S7 



score 



T5T" 



probability 
|2.6e-10 



Description 

sp: [LN:YKGC_ECOLI] [AC:P77212] [GN: YKGC] [OR : ESCHERICHIA COLI] [DE : INTERGENIC REGION] 
[SP:P77212] [DB : swissprot] >pir : [LN : H64 756] [AC:H64756 ] [PN:probable mercury (II) 
reductase, :ykgC protein] [GNrykgC ] [OR : Escherichia coli] [EC : 1 . 16 . 1 . 1] [DB:pir2] 
>gp: [GI:gl657503] [LN : ECU73857] [AC:U73857] [OR : Escherichia coli] [DB : genpept-bctl] 
[DE: Escherichia coli chromosome minutes 6-8.] [NT: similar to S. aureus mercury (II) 
reductase] [LE:26473] [RE:27825] [DI : complement] >gp : [GI :gl786495] [LN: AE000137] 
[AC:AE000137 :U00096] [PN:putative oxidoreductase] [GN:ykgC] [FN:putative enzyme; Not 
classified] [OR : Escherichia coli] [DB : genpept-bct2] [DE Escherichia coli K-12 MG1655 
section 27 of 400 of the completegenome . ] [NT:f450; 35 pet identical (29 gaps) to 430 
residues of] [LE:6292] [RE: 7644] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000987 24337800 c2 862 



[5305 |p55-||46i [[945 ||5.4e-95 



Description 

pir: [LN:T00158] [AC:T00158 ] [PN : amidase , ] [OR : Staphylococcus aureus phage phi PVL] 
[EC:3.5.-.-] [DB:pir3] >gp: [GI :dl032859 :g3341932] [LN: AB009866] [AC : AB009866] 
[PN:amidase (peptidoglycan hydrolase)] [OR : bacteriophage phi PVL] [SR : bacteriophage phi 
PVL (specif icjiost staphylococcus aureus ATC] [DB :genpept-phg] [DE bacteriophage phi 
PVL proviral DNA, complete sequence.] [NT:orf 25] [LE:20199] [RE:21653] [DI:direct] 



ORF Name 



NTID AAID 



F 



17503000987 2438878 tl 126 



NT AA 
LengthLength 
[5306 [717 



score probability 
1288 ||2.3e-25 



Description 

sp: [LN : ARSC_STAAU] [AC:P30330] [GN: ARSC] [OR : STAPHYLOCOCCUS AUREUS] [DE : ARSENATE 
REDUCTASE (ARSENICAL PUMP MODIFIER) ] [SP:P30330] [DB : swissprot] >pir: [LN:D41903] 

[AC:D41903 ] [PN:arsenate reductase,] [GN:arsC ] [CL : protein- tyrosine-phosphatase, low 
molecular weight] [OR : Staphylococcus aureus] [EC:1. -.-.-] [DB:pirl] >gp : [GI :gl50729] 

[LN:PI2ARSRBC] [AC:M86824] [PN:arsenate reductase] [GN:arsC] [FN deduction of arsenate 
to arsenite] [OR:Plasmid pI258] [SR:Plasmid pI258 DNA] [DB : genpept-bctl] [DE:Plasmid 
pI258 arsenic resistance operon (arsRBC) genes, completecds . ] [LE:1894] [RE: 2289] 

[DI .-direct] 



377 



NT AA 

QRF Name ™ ™ Le ^thLe^th ^^ probability 



|A17b03000987_24401462_c3_921 | [ 1535 | [ 5307 [pi" [ | 196 | p^7~ || 9 . 6e-34 

Description 

Sp: [LN : YCKF_BACSU] [AC:P42404] [GN:YCKF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 20.0 
KD PROTEIN IN TLPC-SRFAA INTERGENIC REGION (ORF9) ] [SP:P42404] [DB : swissprot] 
>pir : [LN:H69760] [AC:H69760 ] [PN : conserved hypothetical protein yckF] [GN:yckF ] 
[CL: conserved hypothetical protein MJ1247] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :dl007003 :gl438846] [LN : BACYCK] [AC:D30762] [PNiunknown] [GN : yckF] [OR:Bacillus 
subtilis] [SR: Bacillus subtilis (strain : 168trpC2 ) DNA, clone_lib : lambda DASHI] 
[DB:genpept-bctl] [DE: Bacillus subtilis DNA around 28 degrees region of 
chromosomecontaining yckA-H genes.] [LE:7448] [RE: 8005] [D I : complement] 
>gp: [GI:ell82297:g2632631] [LN:BSUB0002] [AC : Z99105 : AL009126] [GN:yckF] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [NT:similar to hypothetical proteins] [LE:179524] 
[RE:180081] [DI : complement ] >gp : [GI :dl009614 :gl805417] [LN:D50453] [AC:D50453] 
[GN : yckF] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: 168 trpC2) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis DNA for 25-36 degree region containing 
theamyE-srfA region, complete cds . ] [LE: 53828] [RE: 54385] [DI : complement] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AE7B030009fl7_24407327_ciJ43 | [1536 | |5308 |[gg5~j | 23i || 892 | | 2.2e-8S 



Description 

pir: [LN:E69694] [AC : E696 94 : S3 9861 : S40073 ] [PN : ribosomal protein LI] [GN: rplA ] 
[CL: Escherichia coli ribosomal protein LI] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell82036:g2632370] [LN: BSUB0001] [AC : Z99104 : AL009126] [PN : ribosomal protein LI 
(BL1) ] [GN: rplA] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 1 of 21): from 1 to213080.] [SP:Q06797] [LE:119107] 
[RE:119805] [DI :direct] 



NT AA 

QRF Name NTID AAID — — , , score probabi lity 
LengthLength J - 



|A17503000987_244i405u_c2_80(5 | [1537 | [5303 ||T75T]|930 |p7T][uTTT 



Description 

gp: [GI:el296735:g3201550] [LN: SEY17116] [AC:Y17116] [PN: f ibrinogen-binding protein] 
[OR staphylococcus epidermidis] [DB :genpept-bctl] [DE : Staphylococcus epidermidis gene 
encoding f ibrinogen-bindingprotein, complete CDS.] [LE:38] [RE: 3316] [DI: direct] 



NT AA 

QRF Name NTID AAID — _ „ — , , score p robab ility 
LengthLength c *- 



IAI 7 50300098 7 2441587b cl 660 



[1538 | [5310 |pTJu~||99 ||129 ||1.7e-07 



Description 

gp: [GI:gl458327] [LN : CELF08F3 ] [AC:U64847] [GN:F08F3.4] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept - invl ] [DE : Caenorhabditis 
elegans cosmid F08F3.] [LE : 1515 : 1817 : 2044 : 2841] [RE : 1758 : 1994 : 2276 : 3110] 
[DI : direct Join] 

NT AA 

QRF Name NTID AAID — _ „. — , , score probability 
LengthLength c i - 



A17B03000Sfl7_2442217B_t2_402 | [1539 | [5311 |p2~ 1 1 43 | 



Description 
NO-HIT 



378 



ORF Name 



NT ID 



NT AA 
AAID — — A , score 
LenqthLength 



1AI7503000987 24429643 c2 793 



] [ 1540 | [5312 |p^— 1|73 | p 



probability 
| |0.011 



Description 

sp: [LN:YA28_PYRHO] [AC:058584] [GN:PHAL028] [OR : PYROCOCCUS HORIKOSHII] [DE : HYPOTHETICAL 
PROTEIN PHAL02 8] [SP:058584] [DB : swissprot] >pir: [LN:B71136] [AC:B71136 ] 
[PN: hypothetical protein PH0854] [GN:PH0854 ] [CL : hypothetical protein HI0719] 
[OR:Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030891 :g3257265] [LN: AP000003] 
[AC: AP000003 : AB009484 :AB009485 :AB009486 : AB009487 : AB009488 : AB009489] [PN: 137aa long 
hypothetical protein] [GN:PH0854] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB :genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 
544001-777000 nt . position (3/7 ). ] [NT:similar to Swiss_Prot : P37552 percent identity:] 
[LE: 218223] [RE: 218636] [DI : complement] 



ORF Name 



NTID 



_ _ _ NT AA 

AAID — -i — . . score 

Length Length 



probability 



IAI7503000987 24491037 ci 719 



J [1541 | [5313 ||495 1 |164 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID , — , _ — 



LengthLength 



score 



IAI7503000 987 24500387 c3 951 



] [1542 | [5314 |p^-||187 Im- 



probability 
0.0016 



Description 

sp: [LN:Y53_BPT3] [AC:P20327] [GN:5.3] [OR : BACTERIOPHAGE T3] [DE : HYPOTHETICAL GENE 5.3 
PROTEIN] [SP:P20327] [DB : swissprot] >pir: [LN:S07514] [AC:S07514 ] [PN:gene 5.3 protein] 
[GN:5.3 ] [CL:phage T7 gene 2.8 protein] [OR:phage T3] [DB:pir2] >gp : [GI :gl5705] 
[LN:POT3111G] [AC:X17255] [GN:5.3] [OR : Bacteriophage T3] [DB : genpept -phg] 
[DE: Bacteriophage T3 gene 1 to gene 11.] [SP:P20327] [LE:12333] [RE:12638] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



1A17503000987 245953 c3 881 



] [1543 | [5315 |p?~]|192 [piT" 



probability 
2.5e-19 



Description 

gp: [GI:gl314295] [LN: LMU40604] [AC:U40604] [FN:unknown] [OR:Listeria monocytogenes] 
[DB: genpept -bct2] [DE: Listeria monocytogenes ClpC ATPase (mec) gene, complete cds . ] 
[NT : ORF2 ; putative 19 kDa protein] [LE:672] [RE:1196] [DI:direct] 



ORF Name 



NTID AAID 



17503000987 24617130 c3 896 



1544 I F3TF 



NT AA 
LengthLength 



score 



TZTT 



probability 

b2bB | [Q.Q 



Description 

gp: [GI:el87583:gl495791] [LN: SARPOCGEN] [AC:X89233] [PN : DNA-directed RNA polymerase] 
[GN:rpoC] [OR staphylococcus aureus] [DB : genpept-bctl] [EC:2.7.7.6] [DE : S . aureus DNA 
for rpoC gene.] [NT : B ' subunit] [SP:P47770] [LE:<1] [RE:>3171] [DI: direct] 



379 



NT AA 

ORF Name NTID AAID — — score probability 
LengthLength ^ L 



|A17B0300098VJMb4a9JV_ciJJ24 | [1545 | [ 5317 |pT~| [7X3 [| 626 || 3.4e-61 

Description 

sp: [LN:UNG_BACSU] [AC:P39615] [GN : UNG : IPA- 57D] [OR:BACILLUS SUBTILIS] [EC:3.2.2.-] 
[DE : URACIL-DNA GLYCOSYLASE, (UDG) ] [SP:P39615] [DB : swissprot] >pir : [LN: S39712] 
[AC:S39712 :H69728 ] [PN : uracil -DNA glycosylase, ung] [GN:ung ] [CL : uracil -DNA 
glycosylase] [OR:Bacillus subtilis] [EC:3.2.2.-] [DB:pir2] >gp : [GI : g580875] [LN : BSGENR] 
[AC:X73124] [GN:ipa-57d] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis 
genomic region (325 to 333).] [SP:P39615] [LE:59206] [RE:59883] [DI:direct] 
>gp: [GI:ell86296:g2636332] [LN: BSUB0020] [AC : Z99123 : AL009126] [PN : uracil -DNA 
glycosylase] [GNrung] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [EC:3.2.2.-] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:alternate gene name: ipa-57d, ywdG] [SP:P39615] [LE:98079] [RE:98756] 
[DI complement] 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength 1 

|AI7B03000387JUfiB0332_c3_562 | [1546 | [ 5318 |p^~ 1 1 75 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — , — , . score probability 
LengthLength u 

|Ar;503000987_246S04g8J:3_522 | [1547 | [ 5319 |p53~| | 460 || 711 | | 3.4e-70 | 

Description 

gp: [GI:dl037645:g4126638] [LN : AB016282] [AC : AB016282] [PN : site-specif ic recombinase for 
integration and] [OR bacteriophage phi-105] [SR : bacteriophage phi-105 DNA] 
[DB: genpept -phg] [DE : Bacteriophage phi-105 DNA, complete sequence.] [NT :ORF3] 
[LE : 25528] [RE : 26952] [DI : complement] 



NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength L ~ 



|M7B03000987_24656552_c2_772 | [1548 | [5320 jp3T3 — [ |49 || 149 | | 1.2e-10 

Description 

sp: [LN:RL33_THEMA] [AC:P35873] [GN : RPMG] [OR : THERMOTOGA MARITIMA] [DE:50S RIBOSOMAL 
PROTEIN L3 3] [SP:P35873] [DB : swissprot ] >gp : [GI:g407022] [LN : TMNUSGGE] [AC:Z11839] 
[PN:RIBOSOMAL PROTEIN L33] [OR : Thermo toga maritima] [DB : genpept -bet 1] [DE : T .maritima 
nusG gene and genes for ribosomal proteins.] [SP:P35873] [LE:579] [RE: 728] [DI:direct] 
>gp: [GI:g4980957] [LN : AE001723] [AC : AE001723 : AE000512] [PN : ribosomal protein L33] 
[GN:TM0451] [OR : Thermo toga maritima] [DB : genpept -bct2] [DE : Thermo toga maritima section 
35 of 136 of the complete genome.] [NT:similar to SP:P35873 GB:Z11839 PID:407022] 
[LE:5319] [RE:5468] [DIrdirect] 

' I ^ ^— |^ — ■ — nn. ■■ ■ nn - |m[mMrM|M , mm al ■. . ■Ml— — UU II U- . . I.II..II.III. n |. 

NT AA 

ORF Name NTID AAID — _„ — , , score probability 
LengthLength L - 



AI7503000987_246S7i92_r2_3BO | |1S49 | [5321 | p^~| | 50 | . 7e-15 



Description 

sp: [LN:ARSBJSTAAU] [AC:P30329] [GN : ARSB] [OR : STAPHYLOCOCCUS AUREUS] [DE : ARSENICAL PUMP 
MEMBRANE PROTEIN] [SP:P30329] [DB : swissprot] >pir : [LN : C41903 ] [AC:C41903 ] 
[PN: arsenical pump membrane protein] [GN : arsB ] [CL : arsenical pump membrane protein] 
[OR staphylococcus aureus] [DB:pirl] >gp : [GI : gl50728] [LN : PI2ARSRBC] [AC:M86824] 
[PN: arsenic efflux pump protein] [GN : arsB] [FN: arsenic efflux pump component (membrane] 
[OR:Plasmid pI258] [SR:Plasmid pI258 DNA] [DB : genpept-bctl] [DE:Plasmid pI258 arsenic 
resistance operon (arsRBC) genes, completecds . ] [LE:587] [RE: 1876] [DI: direct] 



380 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



F 



][ 



T^5TT 



[5322 ||851 | |2 86 ||£^3 [ Jl 



probability 
6.4e-44 



Description 

sp: [LN:YH17_SYNY3] [AC:P73846] [GN : SLR1717] [OR : SYNECHOCYSTIS SP] [SR:PCC 6803 J 

[DE: HYPOTHETICAL 30.2 KD PROTEIN SLR1717] [SP:P73846] [DB : swissprot] >pir : [LN : S75043] 

[AC:S75043 ] [PN : hypothetical protein slrl717] [OR: Synechocystis sp.] [SR:PCC 6803, , 

PCC 6803] [SR:PCC 6803, ] [DB:pir2] >gp : [GI : dl018638 : gl652 988] [LN:D90910] 
[AC:D90910:AB001339] [PN : hypothetical protein] [OR : Synechocystis sp.] [SR : Synechocystis 

sp. (strain:PCC6803) DNA] [DB :genpept-bctl] [DE : Synechocystis sp . PCC6803 complete 

genome, 12/27, 1430419-1576592.] [NT : ORF_ID : slrl717] [LE:37735] [RE:38541] [DI:direct] 



ORF Name 


NTID 


AAID 


NT AA , , . , . 
Le^thLe^th SCOre Probability 


AI7503000987_24823437_t3_52i 


1551 


5323 


1 495 164 ||7i ||0.044 — | 



gp: [GI:g46550] [LN:SACP221] [AC:X02166] [OR : Staphylococcus aureus] [DB : genpept-bctl] 
[DE: Staphylococcus plasmid pC221.] [NT:pot. reading- frame C (aa 1-90) (4555 is 2nd base 

in] [SP:P03866] [LE:4287] [RE:>4555] [DI:direct] >gp : [GI : el90060 : gl333818 ] [LN: SAPC221] 
[AC:X02529] [OR : Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus 

plasmid pC221 complete DNA sequence.] [NT:pot. orfB (aa 1-92) (4557 is 2nd base in 

COdon)] [SP:P03866] [LE:4289] [RE:>4557] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID — ^ Ut — _ 
LengthLength 



score 



probability 



1A17503000987 24554537 c2 804 



[1552 | [5324 | [300 | p4 || i.ie-37 | 

Description 

pir: [LN:C70070] [AC:C70070 ] [PN : conserved hypothetical protein ywtE] [GNrywtE ] 
[CL: hypothetical protein ywpj] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84491:g2636110] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN : ywtE] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
19 of 21): from 3597091to 3809700.] [NT:similar to hypothetical proteins] [LE:97330] 
[RE:98190] [DI: direct] >gp : [GI : e3 08093 : gl894770] [LN : BSZ92954] [AC:Z92954] [GN:ywtE] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis yws [A, B, C, D, E , F, G] and gerBC 
genes.] [NT:product similar to Bacillus subtilis YxeH and YcsE] [LE:4292] [RE: 5152] 
[DI complement] >gp: [GI : el 1844 91 :g2636110] [LN : BSUB0019] [AC : Z99122 : AL009126] [GN: ywtE] 
[FN:unknown] [OR:Bacillus subtilis] [DB:genpept] [DE:Bacillus subtilis complete genome 
(section 19 of 21): from 3597091to 3809700.] [NT:similar to hypothetical proteins] 
[LE:97330] [RE:98190] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IAI7503000987 2485532b c3 936 



Description 
NO-HIT 



] P 53 I | 5325 IP 5 ""!! 42 1 



ORF Name 



NT AA 

^ ^ Len^thLen^th ^^ probability 



1A17503000987 25358252 F5 622 



Description 
NO-HIT 



[1554 | [5326 [p2~ | |43 | 



ORF Name 



|A17!>03000987 25478801 t3 446 



Description 
NO-HIT 



NT 



AA 



^ ^ Le^thLe^th ^^ probability 
[5327 ||TT7 | 



381 



NT AA 

ORF Name NTID AAID Le ~ thL ength score probability 



A17503000S8V_2bb0^92_c3_972 | | 1SB6 | [5328 | pi - 1|7 . 4e-ll 



Description 

gp: [GI :e244846 :g2764870] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB : genpept -phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NTrgene 17.5] 
[LE: 11342] [RE: 11881] [DI:direct] 

NT AA 

ORF Name NTID AAID — , , — . score probability 
LengthLength 



A:C7S030009a7_2SB78fl27_c3_a57 | [1557 | [5329 [[251 | |86 ||334 Ip.Oe-SP" 



Description 

Sp: [LN : YBXF_STAAU] [AC:Q53602] [OR : STAPHYLOCOCCUS AUREUS] [DE : PROBABLE RIBOSOMAL 
PROTEIN IN RPSL 5'REGION (FRAGMENT)] [SP:Q53602] [DB : swissprot] >gp : [GI :g706920] 
[LN:SAU20869] [AC:U20869] [PN:unknown] [FNrunknown] [OR : Staphylococcus aureus] 
[DB : genpept -bct2] [DE : Staphylococcus aureus ribosomal protein S12 (rpsL) gene, 
completecds, ribosomal protein S7 (rpsG) and ORF 1 genes, partial cds . ] [NT: ORF 1] 
[LE:<1] [RE: 320] [DI:direct] 

NT AA 

ORF Name NTID AAID „ — , , — L1 score probability 
LengthLength L ~ 



AI7S03000987_2BS79g62J:3_S75 | [1558 | [5330 j|122r||40g | ||8 . Oe- 85 



Description 

pir: [LN:H69817] [AC:H69817 ] [PN : aminoacylase homolog yhaA] [GN:yhaA ] [CL :hippurate 
hydrolase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83009 :g2633343] [LN : BSUB0006] 
[AC:Z99109:AL009126] [GN:yhaA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to aminoacylase] [LE: 80123] [RE: 81313] [DI : complement] 

NT AA 

ORF Name NTID AAID , — , , — , score probability 
LengthLength 



AI7503000987_25660937_c3_390 | [1559 | [5331 ||498~||165 | p775~||3 ■ 5e-13 



Description 

sp: [LN:OPUC_BACSU] [AC:P46922] [GN:OPUAC] [OR:BACILLUS SUBTILIS] [DE : GLYCINE 
BETAINE- BINDING PROTEIN PRECURSOR] [SP:P46922] [DB : swissprot] >pir : [LN: 140537] 
[AC : 140537 :F69669 ] [PN:glycine betaine ABC transporter (glycine betaine -binding 
protein) opuAC precursor] [GN:opuAC ] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI :ell82252 :g2632586] [LN : BSUB0002] [AC : Z99105 : AL009126] [PN:glycine betaine ABC 
transporter (glycine] [GNropuAC] [FN:glycine betaine transport (osmoprotection) ] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [SP:P46922] [LE:128023] [RE:128904] [DI:direct] 
>gp: [GI:dl009569:gl805372] [LN:D50453] [AC:D50453] [PN:glycine betain-binding protein 
precursor] [GN:opuAC] [OR: Bacillus subtilis] [SR: Bacillus subtilis (strain: 168 trpC2) 
DNA] [DB : genpept-bctl] [DE: Bacillus subtilis DNA for 25-36 degree region containing 
theamyE-srfA region, complete cds.] [LE:2326] [RE:3207] [DI:direct] >gp : [GI :g984805] 
[LN:BSU17292] [AC:U17292] [PN:glycine betaine -binding protein precursor] [GN:opuAC] 
[OR:Bacillus subtilis] [DB : genpept -bet 2] [DE:Bacillus subtilis ATPase (opuAA) , 
transmembrane protein (opuAB) andglycine betaine -binding protein precursor (opuAC) 
genes, completecds . ] [LE:2332] [RE: 3213] [DI: direct] 



382 



ORF Name 



NT 

NT ID AAID _ — 



AA 



Length Length 



score 



U7b03000987 2bb677B3 c2 773 



] [1560 | [5332 |p^~]|187 |p^~| 



probability 
|7.3e-91 



Description 

sp: [LN:NUSG_STACA] [AC:P36264] [GN:NUSG] [OR : STAPHYLOCOCCUS CARNOSUS] [DE : TRANSCRIPTION 
ANT I TERM I NAT I ON PROTEIN NUSG] [SP:P36264] [DB : swissprot] >pir: [LN:S38870] [AC:S38870 ] 
[PN: transcription antitermination factor nusG] [GN : nusG ] [CL : transcription 
antitermination factor nusG] [OR : Staphylococcus carnosus] [DB:pir2] >gp : [GI :g426473] 
[LN:SCSECE] [AC:X76134] [GN : nusG] [OR : Staphylococcus carnosus] [DB :genpept-bctl] 
[DE : S . carnosus secE, nusG and rplK genes.] [SP:P36264] [LE:331] [RE: 879] [DI: direct] 



ORF Name 



IM7503000987 25972086 EI 108 



NT ID 



AAID 



NT 



AA 



— Ll — _ score probability 
Length Length c *■ 



|5333 ||XT5 [ [44 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID _ — . , _ 



LengthLength 



score 



probability 



IAI7503000987 26173287 c3 970 



"] [1562 | [5334 |pT"~||i20 | pn~ ||9 . 0e-06 



Description 

gp: [GI:e244712:g2764863] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NTigene 16] 
[LE:9322] [RE: 9651] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID , 



Length Length 



score 



probability 



E 



17503000987 26206687 cl 642 



[1563 | [5335 |[210 ||69 ||278 |p~Se-24 | 

Description 

Sp : [LN: SECE_STAAU] [AC:O06442] [GN: SECE] [OR : STAPHYLOCOCCUS AUREUS] [DE : PRE PROTEIN 
TRANSLOCASE SECE SUBUNIT] [SP:O06442] [DB : swissprot ] >gp : [GI : g2 0783 76] [LN : SAU96619] 
[AC:U96619] [PN:SecE] [GN: secE] [OR : Staphylococcus aureus] [DB : genpept-bctl] 
[DE: Staphylococcus aureus NCTC 8325 SecE (secE) , NusG (nusG) and RplK(rplK) genes, 
complete cds . ] [LE:31] [RE: 213] [DI:direct] 



ORF Name 


NTID 


AAID 


NT AA 
„ — ^, „ — ^, score probability 
LengthLength c *- 


AI7503000987_262909i2_t3_547 


|15<54 


5336 


882 293 665 ||2.5e-65 | 



Description 

pir: [LN:H69759] [AC:H69759 ] [PN : conserved hypothetical protein yciA] [GN : yciA ] 
[OR:Bacillus subtilis] [DB:pir2] >gp: [GI : ell82286 :g2632620] [LN : BSUB0002] 
[AC:Z99105:AL009126] [GN:yciA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 2 of 21): from 194651 to415810.] 
[NT:similar to hypothetical proteins] [LE:169177] [RE:170094] [DI: direct] 
>gp: [GI:dl009603 :gl805406] [LN:D50453] [AC:D50453] [GN: yciA] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain:168 trpC2) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 

DNA for 25-36 degree region containing theamyE-srfA region, complete cds.] [LE: 43480] 
[RE : 44397] [DI : direct] 



ORF Name 



NTID 



_ _ _ NT AA 

AAID , — , — L1 score 

LengthLength 



IAI7503000987 26306568 cl 724 



[5337 | proT" 



FT 



probability 
0.019 



Description 

gp: [GI:g4894306] [LN : AF065404 ] [AC : AF065404] [PN:pXOl-90] [OR:Bacillus anthracis] 
[DB:genpept-bct2] [DE: Bacillus anthracis virulence plasmid PX01, complete sequence.] 
[LE: 106772] [RE: 108730] [DI: direct] 



383 



NTID 



NT 

AAID . — 



AA 



Length Length 



score 



[5338 |[252 | pT 



probability 
|| 0.041 



ORF Name 

|A17b0a0009BV_^<>it>01^i>__c2_t>a4 
Description 

pir: [LN:C69161] [AC:C69161 ] [PN:sensory transduction histidine kinase] [GN : MTH468 ] 
[OR:Methanobacterium thermoautotrophicum] [DB:pir2] >gp: [GI :g2621537] [LN : AE000831] 
[AC:AE000831 :AE000666] [PN: sensory transduction histidine kinase] [GN:MTH468] 
[ORrMethanobacterium thermoautotrophicum] [DB : genpept-bctl] [DE : Methanobacterium 
thermoautotrophicum from bases 404817 to 415582 (section 37 of 148) of the complete 
genome.] [NT: Function Code: 12. 12 - Cell Processes, Broad] [LE:3329] [RE: 4993] 
[DI complement] 



ORF Name 



NTID AAID _ — . , . — 



^17503000987 26360327 cl 725 



NT AA 
LengthLength 
^57 



score 



T51~ 



FT1T 



probability 
2.0e-49 



Description 

pir: [LN:T00186] [AC:T00186 ] [PN:dUTP pyrophosphatase,] [OR : Staphylococcus aureus phage 
phi PVL] [EC:3.6.1.23] [DB:pir3] >gp : [GI : dl032887 :g3341960] [LN: AB009866] [AC : AB009866] 
[PN:dUTPase] [OR bacteriophage phi PVL] [SR bacteriophage phi PVL 

(specif ic_host staphylococcus aureus ATC] [DB :genpept-phg] [DE: Bacteriophage phi PVL 
proviral DNA, complete sequence.] [NT:orf 53] [LE:37579] [RE: 38106] [DI:direct] 



ORF Name 



^17503000987 26369082 c2 837 



NTID 



AAID 



NT 



AA 



[5340 



LengthLength 



— , score probability 



Description 
NO-HIT 



ORF Name 



NTID AAID 



IAI7503000987 25375952 cl 714 



NT AA 
LengthLength 
] 1 p41 ||627 | fZOT 



score probability 
223 ||1.7e-18 



Description 

gp: [GI:el254413:g2924242] [LN : BTP901456] [AC:Y14232] [PN : hypothetical protein] 
[OR bacteriophage TP901-1] [DB : genpept-phg] [DE : Bacteriophage TP901-1 ORFs 1-12.] 
[NT:0RF11] [LE:5545] [RE: 6168] [DI:direct] 



ORF Name 



NTID AAID — , — 



E 



17503000987 26569377 c2 810 



T57TT 



5T5T 



NT AA 
LengthLength 



score 



TTT 



probability 
|4.4e-61 



Description 

sp: [LN:HUMS_BACSU] [AC : P42405 : 031477] [GN: YCKG] [OR:BACILLUS SUBTILIS] [EC:4.1.2.-] 
[DE:3-HEXULOSE 6-PHOSPHATE FORMALDEHYDE LYASE)] [SP : P42405 :031477] [DB : swissprot] 
>pir: [LN:A69761] [AC:A69761 ] [PN : D-arabino 3-hexulose 6-phosphate formaldeh homolog 
yckG] [GN : yckG ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82298 :g2632632] 
[LN:BSUB0002] [AC : Z99105 : AL009126] [GN: yckG] [FN:unknown] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 2 of 21) : from 194651 
to415810.] [NT:similar to D-arabino 3-hexulose 6-phosphate] [SP:P42405] [LE:180087] 
[RE : 180719] [DI : complement] 



ORF Name 



NTID 



AAID 



IAI7503000987 26734625 ti 89 



1571 



5T£T 



NT AA 
LengthLength 
T5T 



score 



[2TTT 



probability 
4.1e-17 



Description 

gp: [GI:e308969:g2292761] [LN : BSFI2 1LYS] [AC:X95646] [PN:d-like repressor] [GN:orfl27] 
[OR: Streptococcus thermophilus bacteriophage Sfi21] [DB : genpept-phg] [DE : Streptococcus 
thermophilus bacteriophage Sfi21 DNA; lysogenymodule, 8141 bp.] [NT : putative] [LE:5693] 
[RE: 6076] [DI : complement] 



384 



NT AA 

ORFName NTID AA1V Le ~ thLe ~ th probability 



|Al7b030009fl7_26772^_tl_91 | [ 1572 | [5344 |p^] |167 | p5~ | |6 . le-12 

Description 

pir: [LN:A71608] [AC:A71608 ] [PN:probable integral membrane protein PFB0710c] 
[GN:PFB0710c ] [OR : Plasmodium falciparum] [DB:pir2] >gp : [GI :g3845257] [LN : AE001413] 
[AC:AE001413 :AE001362] [PN : predicted integral membrane protein] [GN: PFB0710c] 
[OR: Plasmodium falciparum] [SR:malaria parasite P. falciparum] [DB : genpept - inv2 ] 
[DE: Plasmodium falciparum chromosome 2, section 50 of 73 of thecomplete sequence.] 
[NTrpredicted by GlimmerM] [LE:4351] [RE:5562] [DI : complement] 



NT AA 

ORF Name NTID AAID — . — , score probability 
LengthLength L - 



Ai:7503000SS^25S3953a_ci_750 [ [1573 | [5345 [piT] | 437 | fTZTT f . 8e-146 



Description 

sp: [LN:YWFO_BACSU] [AC:P39651] [GN : YWFO : IPA-93D] [ORiBACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 51.0 KD PROTEIN IN PTA 3 ' REGION] [SP:P39651] [DB : swissprot ] 
>pir : [LN:G70056] [AC : G70056 : S3 9748 ] [PN:ywfO protein : hypothetical protein ipa-93d] 
[GN:ywfO ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell86260 :g2636296] [LN:BSUB0020] 
[AC:Z99123 :AL009126] [GN:ywfO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 20 of 21) : from 3798401to 4010550.] 
[NT:alternate gene name: ipa-93d; similar to] [SP:P39651] [LE:59934] [RE:61235] 
[DI: complement] >gp: [GI : e267329 :gl561567] [LN:BSUWFO] [AC:Z80355] [PN:Unknown] 
[GN:ywfO] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE :B . subtilis ywfO, ywgA and ywgB 
genes.] [SP:P39651] [LE:267] [RE:1568] [DI:direct] 



NT AA 

ORF Name NTID AAID — — , . score probabili ty 

LengthLength c jL 

p7503000987J>68S47b7_c3_886 | [1574 | [ 5346 || 560 ~| | 219 | p75 || 3.6e-9H 

Description 

sp: [LN:CYSE_STAXY] [AC:P77985] [GN : CYSE] [OR : STAPHYLOCOCCUS XYLOSUS] [EC : 2 . 3 . 1 . 3 0] 
[DE : SERINE ACETYLTRANSFERASE , (SAT)] [SP:P77985] [DB : swissprot] 

>gp: [GI:e261410:gl514656] [LN : SXCYSEREG] [AC:Y07614] [PN:serine O-acetyltransf erase] 
[GN: cysE] [OR staphylococcus xylosus] [DB : genpept-bctl] [EC : 2 . 3 . 1 . 30] [DE : S .xylosus 
cysE gene, genomic region.] [SP:P77985] [LE:707] [RE: 1357] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A17b03000987_27318_c2jm | [1575 | [5347 |piT] | 344 | [TT3F] |8 . 3e-122 



Description 

sp: [LN:ADHl_ZYMMO] [AC:P20368] [GN: ADHA] [OR : ZYMOMONAS MOBILIS] [EC : 1 . 1.1.1] 
[DE: ALCOHOL DEHYDROGENASE I, (ADHI)] [SP:P20368] [DB : swissprot] >pir : [LN: A35260] 
[AC:A35260:E40649:A24801 ] [PN:alcohol dehydrogenase, 1] [GN:adhA ] [CL:alcohol 
dehydrogenase : long- chain alcohol dehydrogenase homology] [OR : Zymomonas mobilis] 
[EC:1. 1.1.1] [DB:pirl] >gp : [GI : gl55571] [LN : ZMOADHA] [AC:M32100] [OR : Zymomonas mobil is] 
[SR:Z. mobilis (strain CP4) DNA] [DB : genpept-bctl] [DE : Z .mobilis alcohol dehydrogenase I 
(adhA) gene, complete cds . ] [NT:alcohol dehydrogenase I (adhA) (EC 1.1.1.1)] [LE:218] 
[RE: 1231] [DI: direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



p7503000987_27659(W:2jm | [1576 | [ 5348 ||37S ~| | 124 || 273 |p.0e-24 | 

Description 

pir: [LN:H70070] [AC:H70070 ] [PN : hypothetical protein ywzC] [GN:ywzC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell86261:g2 636297] [LN:BSUB0020] [AC : Z99123 : AL009126] 

[GN:ywzC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 20 of 21): from 3798401to 4010550.] [LE:61397] [RE:61621] 

[DI complement] 



385 



ORF Name 

| A17bOa0009B7_aB47B«V_ta_351 



NT AA 
NT ID AAID — • _ — , . score 
LengthLength 



probability 



] [1577 | [5349 | |576 | [191 ||7 97 || 2.^e-7 5- 



Description 

sp: [LN:ARSB_STAAU] [AC:P30329] [GNrARSB] [OR : STAPHYLOCOCCUS AUREUS] [DE : ARSENICAL PUMP 
MEMBRANE PROTEIN] [SP:P30329] [DB : swissprot] >pir: [LN:C41903] [AC:C41903 ] 
[PN: arsenical pump membrane protein] [GN: arsB ] [CL : arsenical pump membrane protein] 
[OR: Staphylococcus aureus] [DBrpirl] >gp : [GI : gl50728] [LN: PI2ARSRBC] [AC:M86824] 
[PN: arsenic efflux pump protein] [GN: arsB] [FN: arsenic efflux pump component (membrane] 
[OR:Plasmid pI258] [SR:Plasmid pI258 DNA] [DB : genpept-bctl] [DE:Plasmid pI258 arsenic 
resistance operon (arsRBC) genes, completecds . ] [LE:587] [RE: 1876] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



VI7503000987 29320127 £3 442 



Description 
NO-HIT 

ORF Name 



^ LengthLength probability 
j |5350 ||135 | |44 | 



NTID 



NT 



AA 



AAID — . 



LengthLength 



score probability 



|A17b03000987__29694425_tl_142 



] [1579 | [5351 |p2~ ||283 ||?I8~|p ■ 8e-39 



Description 

gp: [GI:gl502421] [LN : BSU59433 ] [AC:U59433] [PN: 3 -ketoacyl -acyl carrier protein 
reductase] [GN:fabG] [OR:Bacillus subtilis] [DB : genpept-bct2] [DE:Bacillus subtilis 
PlsX (plsX) , malonyl-CoA:Acyl carrier proteintransacylase (fabD) and 3 -ketoacyl -acyl 
carrier protein reductase (fabG) genes, complete cds, and acyl carrier protein (acpP) 
gene, partial cds.] [NT: also called 3-oxoacyl-acyl carrier protein] [LE:1813] [RE: 2553] 
[DI: direct] 



ORF Name 



[A17503000987 30178137 cl 706 



Description 
NO-HIT 



NTID 



„ TO NT AA 

AAID — — , score 

LengthLength 



probability 



|5352 ip^TS | 



FT 



ORF Name 



IAI7503000987 30267937 t3 532 



Description 
NO-HIT 



ORF Name 



1AI7503000987 30742332 c2 841 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



2 [1581 | [5353 ||T?1~||40 | 



_ _ _ _ _ NT AA 

NTID AAID — , — , score 

LengthLength 



probability 



] [1582 | |bib4 ||228 -||75 | [Vg ||0.012 



] 



Description 

pir: [LN:D71169] [AC:D71169 ] [PN : hypothetical protein PH0552] [GN:PH0552 ] 
[OR:Pyrococcus horikoshii] [DB:pir2] >gp : [GI :dl030584 :g3256958] [LN : AP000002] 
[AC:AP000002 :AB0094 75 :AB009476 :AB0094 77 :AB0094 78 :AB0094 79 :AB009480] [PN: 163aa long 

hypothetical protein] [GN:PH0552] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain :OT3) DNA] [DB : genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 

287001-544000 nt . position (2/7) . ] [NT:similar to PIR:F64411 percent identity : 33 . 898 in] 
[LE:205880] [RE:206371] [DI:direct] 



386 



ORF Name 



NT 



AA 



^ MID Le - thLe - th score probability 



17503000987 Jl2b6916 c2 774 



] [ 1583 | [ 5355 |p7~||168 | pr~|[ b . ie-67 



Description 

pir: [LN:S38871] [AC:S38871 ] [PN : ribosomal protein Lll] [GN : rplK ] [CL : Escherichia coli 
ribosomal protein Lll] [OR : Staphylococcus carnosus] [DB:pir2] >gp : [GI :g581638] 
[LN:SCSECE] [AC:X76134] [PN:L11 protein] [GN: rplK] [OR : Staphylococcus carnosus] 
[DB:genpept-bctl] [DE : S . carnosus secE, nusG and rplK genes.] [SP:P36254] [LE:1061] 
[RE: 1483] [DI: direct] 



ORF Name 



NT 



AA 



^ AAIP L en^thLen^th g^ probability 



[AI7503000987 31334838 c3 553 



] [1584 | [535S |[gT^~||i72 | |ggg— ||i . 7e-B2 



Description 

pir: [LN.-E70057] [AC:E70057 ] [PN : hypothetical protein ywhD] [GN: ywhD ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : e267625 : gl565239] [LN : BSTHRZ] [AC:Z80360] [PN:Unknown] 
[GN:ywhD] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis thrZ downstream 
chromosomal region.] [LE:3358] [RE:3876] [DI:direct] >gp : [GI : ell86252 :g2636288] 
[LN:BSUB0020] [AC : Z99123 : AL009126 ] [GN:ywhD] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 20 of 21): from 
3798401to 4010550.] [LE:52838] [RE:53356] [DI : complement] 



ORF Name 



|AI7b03000987_3I34386J:3_6i6 



Description 
NO-HIT 



NT ID 



NT 



AA 



AAID 

LengthLength 

[1585 | [5357 | pB~| | 75 | 



score 



probability 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



1AI7503000987 31353377 c2 785 



"J [1585 | [5358 ||T73~| | 1S7 | |703 | |2.4e-69 



Description 

gp: [GI :dl039003 :g4512400] [LN : AB017508] [AC : AB017508] [GN: rpsG] [OR:Bacillus 
halodurans] [SR:Bacillus halodurans (strain :C-125) DNA] [DB :genpept-bctl] [DE:Bacillus 
halodurans C-125 genomic DNA, 32 kb fragment, completecds . ] [NT : rpsG homologue 
(identity of 84% to B. subtilis )] [LE:8367] [RE: 8837] [DI: direct] 



ORF Name 



IAI7503000987 31552 t2 326 



Description 
NO-HIT 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



] [1587 | [5359 |pi~ 1 140 | 



ORF Name 



NTID 



NT AA 
AAID — _ — , . score 
LengthLength 



probability 



LAI7503000987 32031437 cl 736 



][ 



[5360 |puT-]|901 |[TT5T]|4.3e-ii8 



Description 

sp: [LN:VG12_BPPZA] [AC:P07537] [GN:12] [OR : BACTERIOPHAGE PZA] [DE: PRE -NECK APPENDAGE 
PROTEIN (LATE PROTEIN GP12 ) ] [SP:P07537] [DB : swissprot] >pir : [LN : WMBP12 ] [AC:G24831 ] 
[PN:gene 12 protein] [GN:12 ] [CL: phage PZA gene 12 protein] [OR: phage PZA] [DB:pirl] 
>gp: [GI:g216061] [LN:PZACG] [AC :M11813 :M13904 :M13905] [PN:pre-neck appendage protein] 
[OR bacteriophage PZA] [SR : Bacteriophage PZA DNA] [DB : genpept -phg] [DE : Bacteriophage 
PZA (from B. subtilis) , complete genome.] [LE:11428] [RE:13992] [DI:direct] 



387 



ORF Name 



LL750300098 7 322S1 tl 9B 



Description 
NO-HIT 



NTID 



NT AA 
AAID — _ — , -i scor e 
LengthLength 



probability 



] [1559 | [5361 |p9-|p | 



ORF Name 

|AI7503000987_3231b907_t3_595 

Description 

NO-HIT 



ORF Name 



117503000987 32595152 cl 739 



Description 
NO-HIT 



NTID 



T5W 



n7vTT% NT AA 

AAID — _ — u , score 

LengthLength 



probability 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



probability 



] [1591 | [5353 |pn~||133 | 



ORF Name 



IAI7503000987 3306563 cl 688 



Description 
NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



-| [1592 | |53S4 |pT" 



ORF Name 



NT 



AA 



NTID AAID — , T — , 
LengthLength 



IAI7503000987 33235050 r2 235 



|5355 [pTT I [T^3~ 



score probability 

m — 110.0014 



Description 

pir: [LN:D70063] [AC:D70063 ] [PN : hypothetical protein ywnA] [GN: ywnA ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell84569 :g2636188] [LN: BSUB0019] [AC : Z99122 : AL009126] 
[GN : ywnA] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 19 of 21) : from 3597091to 3809700.] [LE: 168142] [RE: 168543] 
[DI: complement] >gp : [GI : e269486 :gl 592697] [LNiBSUEROP] [AC:Y08559] [PN:Unknown] 
[GNrywnA] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis urease operon and 
downstream DNA. ] [LE:2688] [RE:3089] [DI: direct] >gp : [GI : ell84569 :g2636188] 
[LN:BSUB0019] [AC : Z 9 912 2 : AL009126] [GN : ywnA] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept] [DE: Bacillus subtilis complete genome (section 19 of 21) : from 3597091to 
3809700.] [LE: 168142] [RE: 168543] [D I : complement] 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



score 



probability 



F 



17503000987 33394062 tl 179 



Description 
NO-HIT 



J [1594 | [5366 |p^~ | 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000987 33414693 c2 820 



] [1595 | [5367 ||TuW| |362 |pTT 



probability 
|6.9e-16 



Description 

sp : [LN : KIME_METJA] [AC:Q58487] [GN:MJ1087] [OR : METHANOCOCCUS JANNASCHII] [EC : 2 . 7 . 1 . 36] 
[DE:MEVALONATE KINASE, (MK) ] [SP:Q58487] [DB : swissprot] >pir : [LN : F6443 5] [AC:F64435 ] 
[PN:mevalonate kinase,] [OR:Methanococcus jannaschii] [EC : 2 . 7 . 1 . 36] [DB:pir2] 
[MP:FOR1026197-1027135 ] >gp : [GI :gl591731] [LN:U67551] [AC : U67551 : L77117] 
[PN:mevalonate kinase] [GN:MJ1087] [OR : Methanococcus jannaschii] [DB : genpept -bct2 ] 
[DE:Methanococcus jannaschii section 93 of 150 of the complete genome.] [NT: similar to 
PID:1184118 SP:Q50559 GB:AE000666] [LE:5625] [RE:6563] [DIrdirect] 



388 



ORF Name 



NT AA 
NT ID AAID — T — , . score 
LengthLength 



probability 



F 



I7S03000987 33(^00035 ci 553 



] [1596 | [5358 |pTTF] |705 |p27|[0.0 



Description 

gp: [GI:el422302 :g4582216] [LN : SAU237696] [AC : AJ237696] [PN : elongation factor G (EF-G) ] 
[GNrfus] [FN: translation elongation factor] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE : Staphylococcus aureus fus gene.] [LE:83] [RE: 2164] [DI: direct] 



ORF Name 



NTID 



„ TT ^ NT AA 

AAID — — score 

LengthLength 



probability 



IAI7S03000S87 337S6b03 c2 830 



] [1597 | [5359 |pn~||50 |pg5 ||6.2e-2T 



Description 

gp: [GI:el285113 :g3005826] [LN : BPA2INT] [AC:Y12813] [PN : hypothetical protein] [GN: orfB] 
[OR Lactobacillus casei bacteriophage A2] [DB : genpept -phg] [DE : Bacteriophage A2 rep, 
xis and int genes.] [LE:852] [RE: 1097] [DI : complement] 



ORF Name 



NTID 



„ T „ NT AA 

AAID — , « _ — , , score 

LengthLength 



probability 



AI7503000987_3378625I_c2_865 

Description 

NO-HIT 



1S98 | [5370 ||275~"||81 | 



NT AA 

ORF Name NTID AAID — — , , sco re probability 
LengthLength c *- 



A17503000987_338g9i93_cl_726 | [1599 | [5371 ||IBX~]|50 | [150 ||1.2e-08 



Description 

sp : [LN : RINB_BPPHA] [AC:Q03183] [GN : RINB] [OR : BACTERIOPHAGE PHI-11] [DE : TRANSCRIPTIONAL 
ACTIVATOR RINB] [SP:Q03183] [DB : swissprot] >pir : [LN: A49703] [AC:A49703 ] [PN:intgene 
activator RinB] [OR: phage phi -11] [DB:pir2] >gp : [GI : gl66161] [LN : BPHRINAB] [AC:L07580] 

[PN: Bacteriophage phi-ll int gene activator] [GN:rinB] [FN:Activate bacteriophage 
phi- 11 int gene] [OR : Staphylococcus bacteriophage phi 11] [SR : Bacteriophage phi-ll DNA] 

[DB: genpept -phg] [DE : Bacteriophage phi-ll rinA and rin B genes, required for 
theactivation of Staphylococcal phage phi-ll int expression.] [LE:60] [RE: 248] 

[DI: direct] 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength c 

A17S03000987_3402B066_c2_8S6 | [1500 | [537 2 || 642 ~| [213 |p || 8.2e-0V 



Description 

gp: [GI:e247172 :gl926370] [LN : LBPHIG1E] [AC:X98106] [GN:Rorf232] [OR : Bacteriophage 

phigle] [DB : genpept -phg] [DE : Lactobacillus bacteriophage phigle complete genomic DNA.] 
[LE : 4579] [RE : 5277] [DI : complement] 



389 



NT AA 

ORFName NTID AAID L^thLe^th^e probability 



AI7503000987 34242^0^ C3 956 



[1501 | [5373 j|270 ~| |59 | |V 8 ||0.017 



Description 

sp: [LN:VP8_VARV] [AC:P33039] [GN:L4R:M4R] [OR: VARIOLA VIRUS] [DE : STRUCTURAL PROTEIN VP8 

PRECURSOR (25 KD MAJOR CORE PROTEIN) (P25K) ] [SP:P33039] [DB : swissprot] 
>pir : [LN:A36845] [AC : A3 6 84 5 : S3 3 090 ] [PN:M4R protein : structural protein VP8] 
[CL: vaccinia virus 28K protein] [OR:variola virus] [DB:pir2] >gp : [GI:g262435] 
[LN:S55844] [AC:S55844] [PN : glycoprotein VP8] [GN: L4R] [OR:Variola major virus] 
[SR:Variola major virus India-1967] [DB : genpept - vrl ] [DE : G9R . . . H7R [variola major 

virus, India-1967, Genomic, 19 genes, 18029 nt] . ] [NT : This sequence comes from Fig. 2.] 
[LE:3187] [RE:3942] [DI: direct] >gp : [GI :g438994] [LN : VARCG] [AC:L22579] [OR:Variola 

major virus] [SR:Variola major virus (strain Bangladesh-1975) DNA] [DB : genpept-vrl] 
[DE:Variola major virus (strain Bangladesh-1975) complete genome.] [NT:homolog of 

vaccinia virus CDS L4R (structural] [LE:70583] [RE:71338] [DI:direct] >gp: [GI:g297256] 
[LN: WCGAA] [AC:X69198] [GN: M4R] [OR:Variola virus] [DB :genpept-vrl] [DE:Variola virus 

DNA complete genome.] [SP:P33039] [LE:69950] [RE:70705] [DI:direct] >gp : [GI : g62352] 
[LN : WHINDQKH] [AC:X67119] [PN: structural protein VP8] [GN : L4R COP] [0R:Variola virus] 
[DB:genpept-vrl] [DE: Variola virus (Hindlll-Q, K, H, M, L, I , F genome fragment) genes.] 
[SP:P33039] [LE:19835] [RE:20590] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , , score probabilit y 
Length Length • L - 



|M75030009fl7_34242202_ti_15g | [1602 | [5374 |pTu~| || 0.047 



Description 

sp: [LN: Y076_MYCPN] [AC:P75555] [OR : MYCOPLASMA PNEUMONIAE] [DE : HYPOTHETICAL PROTEIN 
MG076 HOMOLOG] [SP:P75555] [DB : swissprot] >pir : [LN : S73943 ] [AC:S73943 ] [PN:MG076 
homolog G07_orfl38] [OR Mycoplasma pneumoniae] [SR:ATCC 29342, , ATCC 29342] [SR:ATCC 
29342, ] [DB:pir2] >gp : [GI : gl67432 0] [ LN : MPAE 000059] [AC : AE000059 :U00089] 
[GN:G07_orf 138] [OR : Mycoplasma pneumoniae] [DB :genpept-bct2] [DE : Mycoplasma pneumoniae 
section 59 of 63 of the complete genome.] [NT:MG076 homolog, from M. genitalium] 
[LE:168] [RE: 584] [DI:direct] 

NT AA 

ORF Name NTID AAID — _„ — , , score probability 
" — LengthLength — — c * 



AI750300C)9a7_34407053_t3_433 | [1603 | [ 5375 j |7ff5~| p60 ||5gg~|| i . 3e-S4 



Description 

pir: [LN:S76797] [AC:S76797 ] [PN : hypothetical protein] [CL : conserved hypothetical 
protein MJ0165: phosphoribosylaminoimidazole carboxylase catalytic chain homology] 

[OR:Synechocystis sp.] [SR:PCC 6803, , PCC 6803] [SR:PCC 6803, ] [DB:pir2] 

>gp: [GI:dl019442:gl653798] [LN:D90916] [AC : D90916 : AB00133 9] [PN : hypothetical protein] 

[OR:Synechocystis sp.] [SR : Synechocystis sp. (strain : PCC6803 ) DNA] [DB : genpept-bctl] 

[DE:Synechocystis sp. PCC6803 complete genome, 26/27, 3270710-3418851.] 

[NT : 0RF_ID : S111489] [LE : 107538] [RE : 108377] [DI : complement] 

NT AA 

ORF Name NTID AAID — - — , . score probability 
LengthLength c i - 



A175030009{}V_34609703_c3 M 963 | [ 1604 | [5375 |p3~| | 6 5 | pTu~[| l . 6e-05 



Description 

pir: [LN:T00185] [AC:T00185 ] [PN : hypothetical protein 52] [OR : Staphylococcus aureus 
phage phi PVL] [DB:pir3] >gp : [GI :dl032886 :g3341959] [LN: AB009866] [AC : AB009866] 
[OR bacteriophage phi PVL] [SR : bacteriophage phi PVL (specif ic_host : Staphylococcus 
aureus ATC] [DB :genpept-phg] [DE : Bacteriophage phi PVL proviral DNA, complete 
sequence.] [NT:orf 52] [LE: 37356] [RE: 37604] [DI: direct] 



390 



NT AA 

0RF Name NTID AAID T —. hT , — h score probability 



Length Length 1 

A17503000987_346blbbb_cl_734 | [1505 | [ 5377 | p^T] [1040 || 267 || 2.8e-24 



Description 

gp: [GI:e244718:g2764872] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB :genpept-phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT: gene 18] 
[LE:12264] [RE:13373] [DI:direct] 

NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength ^ JL 



AI7g03000987_35ig2800_c3_88g | [1606 | [ 5378 | p3uT]| 50i | [ 1701 || 4 . 2e-175 



Description 

sp: [LN:SYE_BACSU] [AC:P22250] [GNrGLTX] [OR : BACILLUS SUBTILIS] [EC : 6 . 1 . 1 . 17] 
[DE : (GLURS) ] [SP:P22250] [DB : swissprot] >pir : [LN : SYBSET] [AC : A36090 : S66121 : E69635 ] 
[PN:glutamate--tRNA ligase, gltX : glutamyl -tRNA synthetase gltX] [GNrgltX ] 
[CL:glutamate--tRNA 1 igase : glutamine - - tRNA ligase homology] [OR: Bacillus subtilis] 
[EC:6. 1.1.17] [DB:pirl] >gp : [GI : dl005868 : g467480] [LN:BAC180K] [AC:D26185] 
[PN: glutamyl -tRNA synthetase] [GN:gltX] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(sub_species Warburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE:174646] [RE:176097] [DI:direct] 
>gp: [GI:gl43006] [ LN : BACGLTXA] [AC : M55073 : J0564 7] [PN : transfer RNA-Gln synthetase] 
[GN:gltX] [OR:Bacillus subtilis] [SR:Bacillus subtilis 168T DNA] [DB : genpept-bctl] 
[EC: 6. 1.1.17] [DE: Bacillus subtilis glutamyl -tRNA synthetase gene, complete cds . ] 
[LE:1] [RE:1452] [DI:direct] >gp : [GI :g289282] [LN: BACGLUSYN] [AC:L14580] 
[PN: glutamyl -tRNA synthetase] [GN:gltX] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
(strain 168T) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis glutamyl-tRNA transferase 
(gltX) , serineacetyltransferase (cysE) , and cysteinyl-tRNA synthetase (cysS)genes, 
complete cds ' s . ] [LE:1706] [RE:3157] [DI:direct] >gp : [GI : ell82025 :g2632359] 
[LN:BSUB0001] [AC : Z99104 : AL009126] [PN: glutamyl- tRNA synthetase] [GN:gltX] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [EC : 6 . 1 . 1 . 17] [DE:Bacillus subtilis complete genome 
(section 1 of 21): from 1 to213080.] [SP:P22250] [LE:111044] [RE:112495] [DI:direct] 
>gp: [GI:g2653659] [LN:U49789] [AC:U49789] [PN: glutamyl -tRNA synthetase] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis glutamyl-tRNA synthetase gene, 
complete cds.] [LE:1] [RE: 1452] [DI: direct] 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

A17S030003S7_363S0062J:5_3i7 | |1507 | [ 5379 |46 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength c 

|AI7S030009tr;_3S44Bfl7S_cl_72B | [1508 | [ 5380 ||T55~| | 52 |[ 53 || 0.016 



Description 

pir: [LN:C70306] [AC:C70306 ] [PN: conserved hypothetical protein aq_064c] [GN:aq_064c ] 
[CL: conserved hypothetical secreted protein HP0320] [OR:Aquifex aeolicus] [DB:pir2] 
>gp: [GI:g2982828] [LN : AE0006 72 ] [AC : AE000672 : AE000657] [PN : hypothetical protein] 
[GN:aq_064c] [OR:Aquifex aeolicus] [DB : genpept-bct2] [DE:Aquifex aeolicus section 4 of 
109 of the complete genome.] [LE:2190] [RE: 2423] [DI: direct] 
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ORF Name 



NT 



AA 



NTID AAID . — . , , — 



Length Length 



score probability 



F 



17503000987 360b4813 c2 833 



] [1509 | [ 5381 | |3fl7 -|| 128 | [75 1| 0.034 



Description 

pir: [LN:S41186] [AC : S43812 : S41186 ] [PNigene 39 protein] [ORrphage SPP1] [DB:pir2] 
>gp: [GI:e244746:g2764918] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT:gene 39] 
[LE:36342] [RE:36722] [DIrdirect] >gp : [GI :g439643] [LN:BSSPP1] [AC:X67865] [GN:39] 
[OR: Bacteriophage SPP1] [DB :genpept-phg] [DE : B . subtilis phage SPP1 DNA sequence coding 
for products required f orreplication initiation.] [LE:6073] [RE: 6453] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID — , , 

LengthLength 



score probability 



IAI7503000987 36134715 c3 887 



] [1610 | [5382 ||m~]|256 | ||2 . Oe-65 



Description 

sp: [LN:YACO_BACSU] [AC:Q06753] [GN : YACO] [OR:BACILLUS SUBTILIS] [EC:2.1.1.-] 
[DE: HYPOTHETICAL TRNA/RRNA METHYLTRANSFERASE YACO,] [SP:Q06753] [DB : swissprot] 
>pir: [LN:S66124] [AC : S66124 : G6 9741 : 140000 ] [PN : conserved hypothetical protein yacO] 
[GN:yacO ] [CL : conserved hypothetical protein HI0860] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005871:g467483] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 178868] [RE: 179617] 
[DI:direct] >gp: [GI : ell82029 :g2632363] [LN : BSUB0001] [AC : Z99104 : AL009126] [GN:yacO] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 1 of 21): from 1 to213080.] [NT:similar to hypothetical proteins] 
[SP:Q06753] [LE:115266] [RE:116015] [DI:direct] 



ORF Name 



NTID 



_ _ — — ^ NT AA 
AAID — — , , score 
LengthLength 



U7503000987 36228563 c3 960 



probability 
] | |pT"]p32 ||110 H0.00063 



] 



Description 

gp: [GI:g2865254] [LN: AF00823 7] [AC : AF008237 : L28928 : L2892 9 : L28930 : L2893 1] [PN: unknown] 
[OR: Mitochondrion Chlamydomonas eugametos] [SR : Chlamydomonas eugametos] 
[DB:genpept-pln2] [DE : Chlamydomonas eugametos mitochondrion, complete genome.] 
[NT:orf306; intronic ORF; formerly i3; in frame with] [LE:<8260] [RE:9180] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — . — , . score 
LengthLength 



probability 



IAI7503000987 36521067 13 540 



Description 
NO-HIT 



] [1612 | [ 5384 |p^~ | p | 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



^7503000987 3508462 ci 703 



TFTT 



Description 
NO-HIT 



ORF Name 



AI7503000987_3409643_r2_217 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



[5386 |pnT7^[ |TW 



[7T9~ 



probability 
14.8e-71 



Description 

gp: [GI:g4981502] [LN : AE001759] [AC : AE001759 : AE000512 ] [PN : conserved hypothetical 
protein] [GN:TM0964] [OR : Thermo toga maritima] [DB : genpept -bet 2 ] [DE : Thermotoga maritima 
section 71 of 136 of the complete genome.] [NT: similar to GP: 2622211 percent identity: 
60.38;] [LE:5733] [RE:6941] [DI:direct] 
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ORF Name 



NT AA 
NTID AAID „ — — score 
Length Length 



probability 



[1615 | [5387 ||1500~||499 | pn~ ||1 . 5e-28 



|Al7!D03000987_J91^b03_clJ>98 

Description 

pir : [LN:E69825] [AC:E69825 ] [PN:amino acid transporter homolog yhdG] [GN:yhdG ] 
[CLrarginine permease] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82947 :g2633281] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [GNryhdG] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DErBacillus subtilis complete genome (section 6 of 21) : from 999501 
tol209940.] [NT:similar to amino acid transporter] [LE:23330] [RE:24727] [DI:direct] 
>gp: [GI:e325026:g2226202] [LN: BSY14082] [AC:Y14082] [PN : hypothetical protein] [GN:yhdG] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis chromosomal DNA, region 
72 to 75 degrees: spoVRto sspB.] [NT : Similarity to human retroviral receptor (PIR] 
[LE:9978] [RE:11375] [DI:direct] 



ORF Name 



NTID AAID — _ — , score 
Length Length 



probability 



[AI7503000987 3945333 c2 782 



][ 



Description 
NO-HIT 



[5388 ||T55 | 



[HIT 



ORF Name 



NT AA 
NTID AAID — ._ — , , score 
Length Length 



probability 



F 



17503000987 3961S90 c3 922 



] | lg17 | | 5389 |P^^|p57 ||669 ||9.5e-65 



Description 

pir: [LN:H69831] [AC:H69831 ] [PN : acetyl -CoA C-acetyltransf erase homolog yhfS] [GN:yhfS 
] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83037 : g2633371] [LN: BSUB0006] 
[AC:Z99109:AL009126] [GN:yhfS] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to acetyl-CoA C-acetyltransf erase] [LE: 109360] [RE: 110454] [DI : complement] 
>gp: [GI:e324998 :g2226255] [LN : BSY14 084] [AC.-Y14084] [PN : hypothetical protein] [GN:yhfS] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis chromosomal DNA, region 78-80 
degrees: aprE to comK.] [NT : similarity to acetyl -CoAuacetyltransf erases (ThiL,] 
[ LE : 3 7 8 8 ] [RE : 4 8 8 2 ] [DI : compl ement ] 



ORF Name 



NTID 



____ NT AA 

AAID — _ — . score 

Length Length 



probability 



|&:£7S03000987_3963B_i2_3i6 



] |lg!8 | [5390 ||124r[|4i4 | pT^~||7 . 4e-98 



Description 

sp: [LN:YKGC_ECOLI] [AC:P77212] [GN : YKGC] [OR : ESCHERICHIA COLI] [DE : INTERGENIC REGION] 
[SP:P77212] [DB : swissprot ] >pir : [LN:H64756] [AC:H64756 ] [PN:probable mercury(II) 
reductase, :ykgC protein] [GN:ykgC ] [OR: Escherichia coli] [EC : 1 . 16 . 1 . 1] [DB:pir2] 
>gp: [GI:gl657503] [LN : ECU73857] [AC:U73857] [OR : Escherichia coli] [DB : genpept-bctl] 
[DE Escherichia coli chromosome minutes 6-8.] [NT: similar to S. aureus mercury (II) 
reductase] [LE:26473] [RE:27825] [DI : complement] >gp : [GI :gl786495] [LN: AE000137] 
[AC:AE000137 :U00096] [PN:putative oxidoreductase] [GN:ykgC] [FN:putative enzyme; Not 
classified] [OR : Escherichia coli] [DB : genpept -bct2 ] [DE : Escherichia coli K-12 MG1655 
section 27 of 400 of the compl etegenome . ] [NT:f450; 35 pet identical (29 gaps) to 430 
residues of] [LE:6292] [RE: 7644] [DI : complement] 
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ORF Name 



LL7503000987 4031952 c2 775 



NTID 



NT 



AA 



AAID _ . , , 



LengthLength 



score 



probability 



[1519 | [5391 |pn~]|15g ||g¥5"~||1.3e-B2 



Description 

pir: [LN:D69695] [AC:D69695 ] [PN : ribosomal protein L10 (BL5) rplJ] [GNirplJ ] 
[CL: Escherichia coli ribosomal protein L10] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl009473:g786163] [LN : BACRPL2] [AC:D50303] [PN : Ribosomal Protein L10] [GN:rplJ] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: 168) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis genes for ribosomal proteins LI, L10 and L12, partial and complete 
cds.] [LE:347] [RE:847] [DI: direct] >gp : [GI : ell82037 :g2632371] [LN : BSUB0001] 
[AC:Z99104:AL009126] [PN : ribosomal protein L10 (BL5) ] [GNirplJ] [OR:Bacillus subtilis] 
[DBigenpept-bctl] [DEiBacillus subtilis complete genome (section 1 of 21) : from 1 
to213080.] [SP:P42923] [LE:120057] [RE:120557] [DIrdirect] 



ORF Name 



IA17503000987 40676 cl 644 



Description 
NO-HIT 



NT AA 
NTID AAID — — score 
Length Leng til 



probability 



"j [1620 | [5392 jpi~||60 | 



ORF Name 



1AI7503000987 4078305 c3 927 



Description 
NO-HIT 



NTID 



a7vt ^ NT AA 

AAID — i — , , score 

LengthLength 



probability 



J [1621 | |b393 ip^g— 1|41 | 



ORF Name 



UI7503000987 4078375 ti 6 



Description 
NO-HIT 



ORF Name 



^17503000987 4078518 tl 143 



NTID 



MTO NT AA 

^ Len^thLen^th ^^ 



probability 



][ 



F3"9T" 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



][ 



TS2T" 



[5395 |[843~||280 ||521 [[Tr^e^u" 



Description 

Sp : [LN : YXEK_BACSU] [AC:P54950] [GN : YXEK : LP9C] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
49.3 KD PROTEIN IN IDH-DEOR INTERGENIC REGION] [SP:P54950] [DB : swissprot] 
>pir: [LN:E70075] [AC:E70075 ] [PN : monooxygenase homolog yxeK] [GN:yxeK ] 
[CLmitrilotriacetate monooxygenase] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84677:g2636498] [LN : BSUB0021] [AC : Z99124 : AL009126] [GN: yxeK] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DEiBacillus subtilis complete genome (section 
21 of 21): from 3999281to 4214814.] [NT:similar to monooxygenase] [SP:P54950] 
[LE:60328] [RE:61653] [DI : complement] >gp : [GI : dl008923 :gl408496] [LN:D45912] 
[AC:D45912] [GN:yxeK] [ORiBacillus subtilis] [SR:Bacillus subtilis (straimBGSC 1A1 
(Marburg 168; trpC2)) DNA] [DB :genpept-bctl] [DEiBacillus subtilis genome sequence 
between the iol and hut operon, partial and complete cds.] [NT : homologous to dszA gene 
product of Rhodococcus sp.] [LE:9823] [RE: 11148] [Dlidirect] 
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ORF Name 



NT AA 
NTID AAID — , T — scor e 
LengthLength 



03000987 4181527 cl 720 



1624 | [5396 ||447 1 1148 |p~ 



probability 
0.0045 



Description 

sp: [LN:V17_BPT7] [AC : P03781] [GN:1.7] [OR : BACTERIOPHAGE T7] [DE : GENE 1.7 PROTEIN] 
[SP:P03781] [DB:Swissprot] >pir : [LN: W1BP77] [AC : G43002 : G43004 : S42296 : A04406 ] [PNigene 
1.7 protein] [GN:1.7 ] [CL:phage T7 gene 1.7 protein] [ORiphage T7] [DB:pirl] 
[MP:20. 42-21. 89 ] >gp : [GI : gl5512] [LN:PET7XX] [AC:V01127] [OR : Bacteriophage T7] 
[DB :genpept-phg] [DE:Left end of bacteriophage T7 genome. Includes the reading frames 
ofthe genes 0.3, 0.4, 0.5, 0.6, 0.65, 0.7, 1, 1.1, 1.2, 1.3 (earlyproteins) and 1.4, 
1.5, 1.6, 1.7, 2, 2.5, 2.8, 3, 3.5, 4A and 4B(late proteins). Gene 1 is the T7 RNA 
polymerase.] [NT:1.7 protein] [SP:P03781] [LE:8166] [RE:8756] [Dlidirect] 
>gp: [GI :gl5576] [LN: T7CG] [AC : V01146 : J02518 : X00411] [OR : Bacteriophage T7] 
[DB:genpept-phg] [DE:Genome of bacteriophage T7 . ] [NT:gene 1.7] [SP:P03781] [LE:8166] 
[RE: 8756] [DI:direct] 



ORF Name 



NTID 



_ _ NT AA 

AAID — . — , . score 

LengthLength 



IA17503000987 4181577 c3 955 



1625 | [5397 |p5T]|4i6 | 



probability 
3 . 9e-44 



Description 

pir: [LN:H70427] [AC:H70427 ] [PN : replicative DNA helicase] [GN:dnaB ] [CL:phage P22 
gene 12 protein] [OR:Aquifex aeolicus] [DB:pir2] >gp : [GI :g2983861] [LN : AE000742] 
[AC:AE000742 :AE000657] [PN : replicative DNA helicase] [GN:dnaB] [OR:Aquifex aeolicus] 
[DB:genpept-bct2] [DE:Aquifex aeolicus section 74 of 109 of the complete genome.] 
[LE:2249] [RE: 3655] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



117503000987 4329063 c3 900 



] [1626 | [5398 |[T^T]|4i4 | [1786 ||4 . le-184 



Description 

gp: [GI:dl039005:g4512402] [LN : AB017508] [AC : AB017508] [GN : tuf A] [OR:Bacillus 
halodurans] [SR:Bacillus halodurans (strain : C-125) DNA] [DB :genpept-bctl] [DE:Bacillus 
halodurans C-125 genomic DNA, 32 kb fragment, completecds . ] [NT: tuf A homologue 
(identity of 91% to B. subtilis )] [LE: 11150] [RE: 12340] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — . — . score 
LengthLength 



probability 



IAI7503000987 4334510 c2 822 



Description 
NO-HIT 



][ 



|5399 |pT5 | 



ORF Name 



[AI7503000987 4398453 t3 561 



NTID 



NT AA 
AAID — — .score 
LengthLength 



[5400 |pT-j|158 | p 



7T 



probability 
l4.5e-24 



Description 

sp : [LN : NTAA_CHEHE] [AC:P54989] [GN : NTAA : NMOA] [OR : CHELATOBACTER HEINTZII] 
[EC: 1.14. 13.-] [DE : MONOOXYGENASE COMPONENT A) (NTA-MO A)] [SP:P54989] [DB : swissprot] 
>gp: [GI :glll9211] [LN : CBYNMOB] [AC:L49438] [GN:nmoA] [FN : component A of NTA 
monooxygenase] [OR : Chelatobacter heintzii] [DB :genpept-bctl] [DE : Chelatobacter heintzii 
NTA monooxygenase component B (nmoB) , NTAmono oxygenase component A (nmoA) , regulatory 
protein (nmoR) andtransposase (nmoT) genes, complete cds.] [LE:1323] [RE:2684] 
[DIrdirect] >gp : [GI :gl480205] [LN: CHU39411] [AC:U39411] [ PN : NTA monooxygena s e component 
A] [GN:ntaA] [OR : Chelatobacter heintzii] [SR : Chelatobacter heintzii strain=ATCC 29600] 
[DB :genpept-bctl] [DE : Chelatobacter heintzii nitrilotriacetate monooxygenase 
genes, putative regulatory protein (ntaR) , NTA monooxygenase component A(ntaA) and 
component B (ntaB) genes, complete cds.] [LE:1123] [RE: 2484] [DI : complement] 
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NT AA 

QRF Name ?™ AAID Le ~ thLe — th ^£2£e probability 



|Ai7503000987_4460b2_cl_748 | [1629 | [5401 || 590 | |229 | [ 1155 || 2 . be-118 

Description 

sp: [LN:TRA2_STAAU] [AC:P19380] [OR : STAPHYLOCOCCUS AUREUS] [DE : TRANS POSASE FOR INSERTION 
SEQUENCE -LIKE ELEMENT IS431MEC] [SP:P19380] [DB : swissprot] >pir : [LN: S12093] 
[AC:S12093 : JU0116 ] [PNrprobable transposase] [OR : Staphylococcus aureus] [DB:pir2] 
>gp: [GI :g46602] [LN: SAIS431M] [AC : X53818 : M1843 8] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE:S. aureus IS431mec gene associated with methicillin resistance.] 
[NTiputative transposase (AA 1 - 224)] [SP:P19380] [LE:272] [RE:946] [DIrdirect] 
>gp: [GI:el237900:g2791991] [ LN : SAMECAR1 I ] [AC:Y14051] [PN:putative transposase] 
[OR: Staphylococcus aureus] [DB : genpept- bet 1] [DE : Staphylococcus aureus mecA, mecRl, 
mecl genes and ORF168, ORF142 , ORF44 , ORF145 and ORF224 . ] [NT:ORF224] [LE:8096] 
[RE:8770] [DI -.direct] >gp : [GI : dl046034 : g5360858] [LN:D86934] [AC:D86934] 
[PN: transposase for insertion sequence- like element] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain :N315) DNA, clone_lib : library of N31] [DB:genpept] 
[DE: Staphylococcus aureus genes, mec region, partial and complete cds . ] [NT : ORF N062] 
[LE:48054] [RE:48728] [DI:direct] >gp : [GI : dl046044 : g5360868] [LN:D86934] [AC:D86934] 
[PN: transposase for insertion sequence- like element] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain : N315) DNA, clone_lib : library of N31] [DB:genpept] 
[DE : Staphylococcus aureus genes, mec region, partial and complete cds.] [NT :ORF N070] 
[LE:53400] [RE:54074] [DI:direct] 



NT AA 

ORF Name NTID AAID — , — , . score probabili ty 
Length Length L - 



AI7S03000987_472S006_c3_98I 

Description 

NO-HIT 



[1630 [ |5402 ||5T7 | [178 | 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|A17S030009B7_4727203_c3_92B | [1631 | |b403 |p^~| |122 |p || 0.0J2 "[ 

Description 

gp: [GI:dl004088:g505029] [LN:MUSMNS1] [AC:D14849] [PN : meiosis- specif ic nuclear 
structural protein 1] [OR:Mus musculus] [SR:Mus mucsulus (strain ddY) testis pachytene 
spermatocyte cDNA t] [DB : genpept - rod] [DE:Mouse mRNA for meiosis-specif ic nuclear 
structural protein 1(MNS1), complete cds.] [LE:180] [RE:1655] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



1175030009 87 473bB33 t2 359 



[5404 |[7Tl~||23g ]|454 ||S.0e-44 



Description 

sp: [LN:YAAG_BACSU] [AC:P37530] [GN: YAAG] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 24.1 
KD PROTEIN IN SERS-DNAZ INTERGENIC REGION] [SP:P37530] [DB : swissprot] >pir : [LN : S66045] 
[AC:S66045:G69736 ] [PN : deoxypurine kinase subunit homolog yaaG] [GN:yaaG ] 
[CL: Lactobacillus acidophilus deoxyadenosine kinase] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005793:g467405] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept -bet 1] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 86746] [RE: 87369] 
[DI: complement] >gp: [GI : el 181 94 8 :g2 632282] [LN: BSUB0001] [AC : Z99104 : AL009126] [GN:yaaG] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete 
genome (section 1 of 21) : from 1 to213080.] [NT: similar to deoxypurine kinase subunit] 
[SP:P37530] [LE:23144] [RE:23767] [DI : complement] 
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ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



F 



17b03000987 c3 967 



] [1633 | [5405 ||509 ||202 1 1210 ||4.1e-17 



Description 

gp: [GI:e247154:gl926352] [LN : LBPHIG1E] [AC:X98106] [PN:minor capsid protein] 
[GN:Rorf204] [OR : Bacteriophage phigle] [DB :genpept-phg] [DE : Lactobacillus bacteriophage 
phigle complete genomic DNA. ] [LE: 24554] [RE: 25168] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID — _ T — _ score probability 
LengthLength c JL 



^7503000987 4741077 c3 952 



] 



HF1T 



[5406 |[5gT""]j22fl ||23fl ||4.5e-20 



Description 

gp: [GI:e247139:gl926334] [LN : LBPHIG1E] [AC:X98106] [GN:Rorf242] [OR : Bacteriophage 
phigle] [DB :genpept-phg] [DE Lactobacillus bacteriophage phigle complete genomic DNA.] 
[LE : 37600] [RE : 38328] [DI : complement] 



ORF Name 



NTID AAID _ — , — 



[AI750 



3000987 4876967 C3 913 



BT0T" 



NT AA 
LengthLength 



T9T" 



score probability 
1168 ||1.2e-12 



Description 

pir: [LN:G69824] [AC:G69824 ] [PN : conserved hypothetical protein yhdA] [GN:yhdA ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82923 :g2633257] [LN: BSUB0005] 
[AC:Z99108 :AL009126] [GN:yhdA] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 5 of 21): from 802821 tol011250.] 
[NT:similar to hypothetical proteins] [LE:207082] [RE:207606] [DI:direct] 
>gp: [GI:ell82935:g2633269] [LN : BSUB0006] [AC : Z99109 : AL009126] [GN:yhdA] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
6 of 21): from 999501 tol209940.] [NT:similar to hypothetical proteins] [LE:10402] 
[RE:10926] [DI:direct] >gp : [GI : e324946 :g2226142] [LN : BSY14079] [AC:Y14079] 
[PN: hypothetical protein] [GN:yhdA] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis chromosomal DNA, region 75 degrees: glpPFKDoperon and 
downstream.] [NT : similarity to the hypothetical protein YIEF from] [LE: 10050] 
[RE: 10574] [DIrdirect] 



ORF Name 



NTID 



A7VTT . NT AA 

AAID — — _ score 

Length Length 



probability 



1AI7503000987 4882760 c3 1001 



Description 
NO-HIT 



[5408 |p^~| |50 | 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IA17503000987 4885075 El 559 



[1^37 | [5409 |p^— 1|225 | [753~ ||5 . 4e-79 



Description 

Sp : [LN : YAAF_BACSU] [AC:P37529] [GN : YAAF] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 25.4 
KD PROTEIN IN SERS-DNAZ INTERGENIC REGION] [SP:P37529] [DB : swissprot] >pir : [LN : S66044] 
[AC:S66044 :F69736 ] [PN : deoxypurine kinase subunit homolog yaaF] [GN:yaaF ] 
[CL: Lactobacillus acidophilus deoxyadenosine kinase] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl005792:g467404] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] [DB : genpept-bctl] [DE : B . 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 86096] [RE: 86749] 
[DI: complement] >gp : [GI : ell81947 : g2632281] [LN: BSUB0001] [AC : Z99104 : AL009126] [GN:yaaF] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 1 of 21): from 1 to2l3080.] [NT:similar to deoxypurine kinase subunit] 
[SP:P37529] [LE:22494] [RE:23147] [D I : complement] 



397 



NT AA 



ORF Name NT ID AAID Le ^ t frLength 3core probability 



|Al7B030009B7_4a9m7_clJJ46 | [1538 | [ 5410 |p7T^ |5uT | pFT|| 7 . 6e- 160 1 

Description 

pir: [LN:H69780] [AC:H69780 ] [PN : antibiotic transport-associated protein homolog ydfJ] 
[GNrydfJ ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : dl020130 :gl881350] [LN: AB001488] 
[AC.-AB001488] [GN:ydfJ] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain:168) DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis genome sequence, 148 kb sequence of the 

regionbetween 35 and 47 degree.] [NT: PROBABLE TRANSPORT PROTEIN, SIMILAR TO ANTIBIOTIC] 
[LE:122935] [RE:125109] [DI:direct] >gp : [GI:ell82509:g2 632843] [LN:BSUB0003] 
[AC:Z99106 :AL009126] [GNrydfJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 3 of 21): from 402751 to611850.] 
[NT: similar to antibiotic transport-associated protein] [LE: 186681] [RE: 188855] 
[DI: direct] 



NT AA 

ORF Name NTID AAID Le ^ thLe ~ th score probability 



H7503000987 4501703 c2 792 



T£T9~ 



[5411 |pp— |ji69 ||95 ||0,0031 



Description 

pir: [LN:D69887] [AC:D69887 ] [PN : conserved hypothetical protein ynaD] [GN:ynaD ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : gl750115] [LN:BSU66480] [AC:U66480] [PN: YnaD] 
[GN:ynaD] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis SpoVK (spoVK) , 
YnbA (ynbA) , YnbB (ynbB) , GlnR(glnR), glutamine synthetase (glnA) , YnaA (ynaA) , YnaB 
(ynaB) , YnaC(ynaC), YnaD (ynaD), YnaE (ynaE) , YnaF (ynaF) , YnaG (ynaG) , YnaH(ynaH), 
Ynal (ynal) , YnaJ (ynaJ) , xylan beta-1 , 4-xylosidase (xynB) , xylose repressor (xylR) , 
xylose isomerase (xylA) , xylulose kinase (xylB) , YncB (yncB) , YncC (yncC) , YncD (yncD) 
and YncE (yncE)genes, complete cds . ] [LE:9169] [RE:9681] [DIrdirect] 

>gp: [GI:ell83411:g2634136] [LN : BSUB0010] [AC : Z99113 : AL009126] [GN:ynaD] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
10 of 21): from 1781201to 2014980.] [NT:similar to hypothetical proteins] [LE:101220] 
[RE: 101732] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength c L - 

|&I7S030009fl7_4946962_c3_907 | [1640 | [ 5412 [ [TuTF] | 355 | p3~ || 1 . Oe-11 | 

Description 

gp: [GI:g3264834] [LN : AF072 541] [AC : AF072541] [PN:xylitol dehydrogenase] [GN:xdh] 
[FN:xylose utilisation] [OR : Galactocandida mastotermitis] [DB :genpept-pln2] 
[EC: 1.1. 1.9] [DE : Galactocandida mastotermitis xylitol dehydrogenase (xdh) gene , complete 

cds.] [NT : a member of the medium chain dehydrogenase] [LE:301:373] [RE : 312 : 1422] 
[DI : direct Join] 



NT AA 

ORF Name NTID AAID — . T — , , score probability 

LengthLength 



Al7503000987_4964677_c2J332 | [1541 | [5413 | [^~[ | 1 4 7 || 257 | 



|4.3e-22 



Description 

pir: [LN:T00179] [AC:T00179 ] [PN : ssDNA- binding protein] [OR : Staphylococcus aureus phage 
phi PVL] [DB:pir3] >gp : [GI : dl032880 :g3341953] [LN : AB009866] [AC : AB009866] [PN:ssDNA 
binding protein] [OR : bacteriophage phi PVL] [SR bacteriophage phi PVL 
(specif ic_host staphylococcus aureus ATC] [DB : genpept-phg] [DE : Bacteriophage phi PVL 
proviral DNA, complete sequence.] [NT:orf 45] [LE:34657] [RE:35127] [DI:direct] 



398 



NT AA 

ORF Na me NTID AAID „ — - _ — ,, score probability 
LenqthLength ^ 



Al7b030009B7_b0«009a_c2_a0tf | [1542 | [5414 ||744 | |247 | pg~ ||6 . 5e- 51 



Description 

sp : [LN : YBFT_BACSU] [AC:031458] [GN : YBFT] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 27.3 
KD PROTEIN IN GLTP-CWLJ INTERGENIC REGION] [SP:031458] [DB : swissprot] >pir : [LN: E69750] 
[AC:E69750 ] [PN : glucosamine- 6 -phosphate isomerase homolog ybfT] [GNiybfT ] 
[CL: glucosamine- 6 -phosphate isomerase] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl034099:g3599658] [LN : AB006424] [AC : AB006424 ] [GNrybfT] [ORrBacillus subtilis] 
[SRrBacillus subtilis (strain:168) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis genomic 
DNA, 70 kb region between 17 and 23degree.] [LE: 59813] [RE: 60562] [DI : complement] 
>gp: [GI:ell82188:g2632522] [LN : BSUB0002] [AC : Z99105 : AL009126] [GN:ybfT] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [NT:similar to glucosamine -6 -phosphate isomerase] 
[SP:031458] [LE: 62162] [RE: 62911] [DI : complement] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c L 



[M75030009fl7_5i25076_l:3_531 | [1543 | [ 5415 || 810 "] pg9 || 873 | p.3e-87 

Description 

sp: [LN:YWFI_BACSU] [AC:P39645] [GN : YWFI : IPA- 8 7R] [OR:BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 29.5 KD PROTEIN IN ROCC-PTA INTERGENIC REGION] [SP:P39645] 
[DB: swissprot] >pir: [LN:S39742] [AC : S39742 : B70056 ] [PN:ywfI protein hypothetical 

protein ipa-87r] [GN:ywfI ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : g414011] 
[LN:BSGENR] [AC:X73124] [GN:ipa-87r] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genomic region (325 to 333).] [SP:P39645] [LE:90200] [RE:90964] 
[DI: complement] >gp: [GI :ell86267:g2 636303] [LN: BSUB0020] [AC : Z99123 : AL009126] [GN:ywfI] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate gene name: ipa-87r; 

similar to] [SP:P39645] [LE:66995] [RE:67759] [DI:direct] 

NT AA 

ORF Name NTID AAID — — , , scor e probability 
LengthLength 



I7503000987_5251588_ci_747 | [1644 | [5416 | 193 | | 87 || 0 . 0010 



Description 

gp: [GI:gl079814] [LN:S79230] [AC:S79230] [PN:BM1P1] [GN:CYP106] [OR:Bacillus 
megaterium] [DB :genpept-bct2] [DE : CYP106-BM1P2 orf . . . CYP106=P450BM-1 orf {regulatory 
regions} [Bacillus megaterium, mRNA Partial, 3 genes, 1400 nt].] [NT:positive 
transcription factor involved in] [LE:763] [RE: 1059] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ — , score probability 
Length Length *- 

&:t7S030009fl7_S2BflSiS_c3_96B | [1645 | [5417 | 437 | p3F]| 2 . le-132 — | 



Description 

sp: [LN:TERL_BPSPP] [AC:P54308] [GN:2] [OR : BACTERIOPHAGE SPP1] [DE : TERMINASE LARGE 
SUBUNIT (G2P) ] [SP:P54308] [DB : swissprot ] >pir : [LN : S24451] [AC:S24451 ] [PN : terminase] 
[0R:phage SPP1] [DB:pir2] >gp : [GI : e244468 : g2764840] [LN: BSPP1GENM] [AC:X97918] 
[OR bacteriophage SPP1] [DB : genpept -phg] [DE : Bacteriophage SPP1 complete nucleotide 
sequence.] [NT:gene2] [SP:P54308] [LE:307] [RE:1575] [DI: direct] >gp : [GI :gl5466] 
[LN:NCSPP116] [AC : X56064 : S35313 ] [PN : terminase] [GN:SPP1 gene 2] [OR : Bacteriophage 
SPP1] [DB: genpept -phg] [DE : B . subtilis bacteriophage SPP1 genes for terminase and 
portalprotein.] [SP:P54308] [LE:633] [RE: 1901] [DI: direct] 
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ORF Name 



NT 



AA 



^ AAI2 Le ^thLe^th ^g probability 



|AI7503000987_b370450_t3_432 



1645 | |541B |p3T]|427 | pTT5~||3 . 6e-27 



Description 

pir: [LN:A69256] [AC:A69256 ] [PN : hypothetical protein AF0049] [OR : Archaeoglobus 
fulgidus] [DB:pir2] >gp : [GI :g2650605] [LN : AE001103] [AC : AE001103 : AE000782] [PN:A. 
fulgidus predicted coding region AF0049] [GN:AF0049] [OR : Archaeoglobus fulgidus] 
[DB:genpept-bct2] [DE : Archaeoglobus fulgidus section 4 of 172 of the complete genome.] 
[NT: hypothetical protein; identified by GeneMark;] [LE:7830] [RE:9068] [DI : complement] 



ORF Name 



NT 



AA 



NTID AAID . — , _ 



LengthLength 



score 



^17503000987 56626 c2 855 



1547 | [5415 |p5~|[XTT 



probability 
0.0020 



Description 

gp: [GI:gl209829] [LN : BBU45421] [AC:U45421] [GN : REP+] [ORrBorrelia burgdorferi] [SR:Lyme 
disease spirochete strain=297] [DB :genpept-bctl] [DErBorrelia burgdorferi 2.9-1 locus, 
ORF 5-8, ORF-A-D, REP+ , REP- , andlipoprotein (LP) genes, complete cds . ] [NT: repeat 
motif -containing gene] [LE:4602] [RE:5156] [DI:direct] 



ORF Name 



NTID AAID — — score probability 



|A17b03000987 58 7 811 tl 125 



Description 
NO-HIT 



NT AA 
LengthLength 
[5420 |pT5 | fTT¥ 



ORF Name 



NTID 



AAID 



IAI7503000987 6055525 cl 755 



NT AA 
LengthLength 

[5421 [ |S5 



score probability 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID Le ^ thLe ~ th score probability 



AI7503000987 6149152 c3 884 



] 



[5422 |pTgT]|462 ||1543 | 



|2.3e-158 



Description 

sp : [LN : RADA_BACSU] [AC:P37572] [GN : RADA : SMS] [OR:BACILLUS SUBTILIS] [DE:DNA REPAIR 
PROTEIN RADA HOMOLOG (DNA REPAIR PROTEIN SMS HOMOLOG) ] [SP:P37572] [DB : swissprot] 
>pir: [LN:S66116] [AC : S66116 : 140509 :A69709 ] [ PN : DNA repair protein homolog sms] [GNrsms 
] [CL :DNA repair protein sms] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl005863 :g467475] [LN:BAC180K] [AC:D26185] [PN:unknown] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (sub_species : Marburg, strain: 168) DNA] [DB : genpept-bctl] [DE:B. 
subtilis DNA, 180 kilobase region of replication origin.] [LE: 169695] [RE: 171071] 
[DI:direct] >gp : [GI :ell82020:g2 632354] [LN: BSUB0001] [AC : Z99104 : AL009126] [PN: DNA 
repair protein homolog] [GN:sms] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus 
subtilis complete genome (section 1 of 21) : from 1 to213080.] [NT : alternate gene name: 
yacJ] [SP:P37572] [LE:106093] [RE:107469] [DIrdirect] 



ORF Name 



AI7503000987_6250050_r2_233 

Description 

NO-HIT 



„™ NT AA 
NTID AAID — . T — , , score 
LengthLength 



probability 



] I 1651 1 I 5423 IF^Hl 48 I 



400 



ORF Name 

[A17b03000987_6406337_t3_448 



NTID 



NT AA 
AAID — , -j- — score 
LengthLength 



probability 



] [1552 | [5424 |pi0"]|59 | 



Description 
NO-HIT 



ORF Name 



NTID 



_ _ NT AA 

AAID — , — , , score 

LengthLength 



E 



17503000987 6407136 c2 840 



probability 
|1.6e-5S 



Description 

sp: [LN:SIZ_BPSPP] [AC:P54309] [GN:6:SIZ] [OR : BACTERIOPHAGE SPP1] [DE : PORTAL PROTEIN 
(PORTAL VERTEX PROTEIN) (GP6) ] [SP:P54309] [DB : swissprot] >pir : [LN : S21805] 
[AC:S21805:S24455:S36725 ] [PN:portal protein:gene 6 protein] [GN:6 ] [OR:phage SPP1] 
[DB:pir2] >gp : [GI : e244702 : g2764847] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE : Bacteriophage SPP1 complete nucleotide sequence.] [NT: gene 6] 
[SP:P54309] [LE:2334] [RE:3845] [DIrdirect] >gp : [GI : gl5470] [LN:NCSPP116] 
[AC:X56064 :S35313] [PN:portal protein] [GN:SPP1 gene 6] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE : B . subtilis bacteriophage SPP1 genes for terminase and 
portalprotein.] [SP:P54309] [LE:2660] [RE:4171] [DI: direct]. 



ORF Name 



NTID 



„ TO NT AA 

AAID — » _ — _ score 

LengthLength 



probability 



^17503000957 6443763 c2 769 



[1654 | [5426 |p3~| |135 | pg~~||2 . Oe-26 



Description 

pir: [LN:C69742] [AC:C69742 ] [PN : conserved hypothetical protein yazC] [GN:yazC ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell82028 : g2632362] [LN:BSUB0001] 
[AC:Z99104 :AL009126] [GNryazC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1 ] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:similar 
to hypothetical proteins] [LE: 114851] [RE: 115282] [DI:direct] 



ORF Name 



NT AA 

NTID AAID — _ T . , score 

LengthLength 



probability 



IAI7503000987 661062 cl 641 



■j [1655 | [5427 |p¥u"] |479 | p3FF]|5 . 2e-161 



Description 

sp: [LN:SYC_BACSU] [AC:Q06752] [GN : CYSS : SPNA] [OR:BACILLUS SUBTILIS] [EC : 6 . 1 . 1 . 16] 
[DE: (CYSRS)] [SP:Q06752] [DB : swissprot] >pir: [LN:C53402] 

[AC:C53402:S44447:S66123:B69612 ] [PN: cysteine- -tRNA ligase, cysS : cysteinyl- tRNA 
synthetase cysS] [GN:cysS ] [CL: cysteine- -tRNA ligase] [OR: Bacillus subtilis] 
[EC:6. 1.1.16] [DB:pirl] >gp : [GI :dl 005870 :g4 67482] [LN:BAC180K] [AC:D26185] 
[PN: cysteinyl -tRNA synthetase] [GN:cysS] [OR: Bacillus subtilis] [SR: Bacillus subtilis 
(sub_species : Marburg, strain:168) DNA] [DB : genpept -bet 1] [DE : B . subtilis DNA, 180 
kilobase region of replication origin.] [LE:177049] [RE:178449] [DI:direct] 
>gp: [GI:g289284] [LN : BACGLUSYN] [AC:L14580] [PN : cysteinyl -tRNA synthetase] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain 168T) DNA] [DB : genpept -bctl] [DE:Bacillus 
subtilis glutamyl-tRNA transferase (gltX) , serineacetyltransf erase (cysE) , and 
cysteinyl -tRNA synthetase (cysS)genes, complete cds • s . ] [LE:4109] [RE:5509] [DI:direct] 
>gp: [GI:g499303] [LN:BSCTS] [AC:X73989] [PN : cysteine- -tRNA ligase] [GN : SPNA/ CYSS] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 6 . 1 . 1 . 16] [DE :B . subtilis gene for 
cysteinyl -tRNA synthetase.] [SP:Q06752] [LE:321] [RE: 1721] [DI:direct] 
>gp: [GI:ell82027:g2632361] [LN : BSUB0001] [AC : Z99104 : AL009126] [PN: cysteinyl -tRNA 
synthetase] [GN:cysS] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 6 . 1 . 1 . 16] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] 
[NT alternate gene name: snpA] [SP:Q06752] [LE: 113447] [RE: 114847] [DI: direct] 



401 



ORF Name 



NT 



AA 



NTID AAID Le ^thLe^th ^^ probability 



|Al7b030009B7 fe72b70 7 c2 770 



[1555 | [5428 ||53^ — 1 |183 | prr" ||1 . 7e-29 



Description 

sp: [LN:YACP_BACSU] [AC:P37574] [GN: YACP] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 19.7 
KD PROTEIN IN CYSS 3 1 REGION] [SP:P37574] [DB : swissprot] >pir : [LN : S66125] 
[AC:S66125 :H69741 ] [PN : conserved hypothetical protein yacP] [GNryacP ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005872 :g467484] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SRrBacillus subtilis (sub_species : Marburg, strain:168) DNA] 
[DB:genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:179624] [RE:180136] [DI: direct] >gp : [GI :ell82030:g2 632364] [LN : BSUB0001] 
[AC:Z99104 :AL009126] [GNryacP] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept -bet 1 ] 
[DE:Bacillus subtilis complete genome (section 1 of 21): from 1 to213080.] [NTisimilar 
to hypothetical proteins] [SP:P37574] [LE: 116022] [RE: 116534] [DIrdirect] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



E 



17503000987 6757338 c3 $06 



probability 
[5429 ||i085~||362 | | ii23 || 1 . 7e-il4 



Description 

pir: [LN:H69750] [AC:H69750 ] [PN : branched- chain amino acid aminotransferase homolog 
ybgE] [GNrybgE ] [CL :branched-chain-amino-acid transaminase BAT1] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl034103 : g3599662] [LN : AB006424] [AC : AB006424] [GNrybgE] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: 168) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis genomic DNA, 70 kb region between 17 and 23degree.] [LE: 62005] 
[RE:63075] [DI: direct] >gp : [GI :ell82191:g2 632525] [LN : BSUB0002] [AC : Z 99105 : AL009126] 
[GNrybgE] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DErBacillus subtilis 
complete genome (section 2 of 21): from 194651 to415810.] [NT:similar to branched- chain 
amino acid] [LE:64355] [RE:65425] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID ^ T _ 

LengthLength 



score 



probability 



U75030009B7 69327b0 ±2 348 



[1658 [ [5430 |p^— j |125 | ||4 . 8e-23 



Description 

pir: [LN:C69906] [AC:C69906 ] [PN : hypothetical protein yojF] [GN : yoj F ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell85419 : g2 634340] [LN:BSUB0011] [AC : Z99114 : AL009126] 
[GN: yoj F] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis 
complete genome (section 11 of 21): from 2000171to 2207900.] [LE:121397] [RE:121747] 
[DI: complement] >gp : [GI :g3169322] [LN : AF026147] [AC : AF026147] [PN : Yoj F] [GN: yoj F] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bct2] [DE:Bacillus subtilis Yoj A 
(yojA) , YojB (yojB), YojC (yojC) , YojD(yojD), YojE (yojE) , YojF (yojF), YojG (yojG) , 
YojH (yojH), Yoj I (yoj I), YojJ (yoj J) , YojK (yojK) , YojL (yojL) , YojM (yojM) , 
YojN(yojN), and YojO (yojO) genes, complete cds; and OdhA (odhA) gene, partial cds . ] 
[LE:3518] [RE:3868] [DI:direct] 



ORF Name 



|AI7503000987_7032188_t2_246 



NTID 



p31 



AAID _ 



NT AA 
LengthLength 
126 | \ZI — 



score probability 



Description 
NO-HIT 



402 



ORF Name 



NT 



AA 



NT ID AAID , 

LengthLength 



score 



F 



17503000987 70fc«7!>l c3 984 



[1550 | [5432 ||945 | [314 ~j |974 | [ 



probability 
|4 . be-98 



Description 

sp: [LN:IOLS_BACSU] [AC:P46336] [GN : IOLS : SS92ER] [OR : BACILLUS SUBTILIS] [DE : IOLS PROTEIN 
(VEGETATIVE PROTEIN 147) (VEG147) ] [SP:P46336] [DB : swissprot] >pir: [LN:D69646] 
[AC:D69646 ] [PN:myo- inositol catabolism iolS] [GNriolS ] [CL : conserved hypothetical 
protein YPL088w] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : dl022457 : g904205] 
[LN:AB005554] [AC : AB005554 : D45242 : D3162 9] [GN:iolS] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (strain:BGSC 1A1) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis genomic DNA, 36 
kb region between gnt and ioloperons . ] [NT : plausibly involved in inositol catabolism] 
[LE:32539] [RE:33471] [DI : complement] >gp : [GI : ell84703 :g2636524] [LN:BSUB0021] 
[AC:Z99124 :AL009126] [GNriolS] [ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 21 of 21): from 3999281to 4214814.] [NT : alternate 
gene name: yxbF; myo-inositol catabolism] [SP:P46336] [LE:85121] [RE:86053] [DI:direct] 



ORF Name 



|Ai7503000987_7i50287_c2_805 

Description 

NO-HIT 



NTID 



AAID 



NT 



AA 



LengthLength probability 



ORF Name 



11 7 503000987 7 154191 c2 798 



Description 
NO-HIT 



ORF Name 



F 



17503000987 7229580 ci 534 



NTID 



NT 



AA 



AAID , 

LengthLength 

|5434 ||T5u | |49 I 



score 



probability 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



] [1553 | [5435 ||153 |p77""||8 ■ 3e-35 



Description 

sp: [LN:CTSR_BACSU] [AC:P37568] [GN : CTSR] [OR:BACILLUS SUBTILIS] [DE : TRANSCRIPTIONAL 
REGULATOR CTSR] [SP:P37568] [DB : swissprot] >pir : [LN : S66112 ] [AC : S66112 : D6 9610 ] 
[PN: transcription repressor of class III stress genes ctsR] [GN: ctsR ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : dl005859 : g467471] [LN:BAC180K] [AC:D26185] [PN:unknown] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (sub_species : Marburg, strain:168) DNA] 
[DB: genpept-bctl] [DE : B . subtilis DNA, 180 kilobase region of replication origin.] 
[LE:165048] [RE:165512] [DI:direct] >gp : [GI :ell82016:g2 632350] [LN: BSUB0001] 
[AC:Z99104 :AL009126] [PN : transcriptional regulator] [GN: ctsR] [FN:negative regulation 
of class III stress genes] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 1 of 21): from 1 to213080.] [NT:alternate gene name: 
yacG] [SP:P37568] [LE:101446] [RE:101910] [DI:direct] 



ORF Name 



NTID 



„ T „ NT AA 

AAID — — , score 

LengthLength 



[5436 |pn~ ||145 ||158 | [ 



probability 
1.3e-ll 



LAI7 503000987_72 7 52 63_Cl_733 
Description 

gp: [GI:e244713:g2764865] [LN : BSPP1GENM] [AC:X97918] [OR : Bacteriophage SPP1] 
[DB:genpept-phg] [DE bacteriophage SPP1 complete nucleotide sequence.] [NT:gene 17] 
[LE:10063] [RE:10467] [DI:direct] 



403 



ORF Name 



NT ID 



NT AA 
AAID — — . , score 
LengthLength 



probability 



AI7503000987 782byU tl 114 



1555 I [5437 ||171 | [55 |p |[0.044 



Description 

gp: [GI:e289995:gl764015] [LN:CIC0S41] [AC:Z83760] [PN:COS41.5] [ORrCiona intestinalis] 
[DB:genpept-invl] [DE:Ciona intestinalis DNA sequence from cosmid C0S41.] [NT:weak 
similarity to membrane transport proteins] [LE : 11497 : 13519 : 14634] 
[RE : 13190 : 13703 : 14681] [DI : complement Join] 



ORF Name 



NTID 



_ _ -py^ NT AA 

AAID — « — , , score 

LengthLength 



probability 



[M75030005fl7_783425_ci_675 



5573" 



T31T 



FT" 



Description 
NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IA17503000987 818942 c3 989 



[5439 ||?T7~||158 |pT7T" 



probability 
3.4e-17 



Description 

sp: [LN:OPUC_BACSU] [AC:P46922] [GN:OPUAC] [OR:BACILLUS SUBTILIS] [DE : GLYCINE 
BETAINE- BINDING PROTEIN PRECURSOR] [SP:P46922] [DB : swissprot] >pir : [LN : 14053 7] 
[AC:I40537:F69669 ] [PN:glycine betaine ABC transporter (glycine betaine-binding 
protein) opuAC precursor] [GN:opuAC ] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell82252:g2632586] [LN : BSUB0002] [AC : Z99105 : AL009126] [PN:glycine betaine ABC 
transporter (glycine] [GN:opuAC] [FNrglycine betaine transport (osmoprotection) ] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
2 of 21): from 194651 to415810.] [SP:P46922] [LE:128023] [RE:128904] [DI:direct] 
>gp: [GI:dl009569:gl805372] [LN:D50453] [AC:D50453] [PN:glycine betain-binding protein 
precursor] [GNropuAC] [OR:Bacillus subtilis] [SR:Bacillus subtilis (strain: 168 trpC2) 
DNA] [DB :genpept-bctl] [DE:Bacillus subtilis DNA for 25-36 degree region containing 
theamyE-srf A region, complete cds . ] [LE:2326] [RE:3207] [DI:direct] >gp : [GI :g984805] 
[LN:BSU17292] [AC:U17292] [PN:glycine betaine-binding protein precursor] [GNropuAC] 
[OR:Bacillus subtilis] [DB : genpept-bct2] [DE:Bacillus subtilis ATPase (opuAA) , 
transmembrane protein (opuAB) andglycine betaine-binding protein precursor (opuAC) 
genes, completecds . ] [LE:2332] [RE: 3213] [DI:direct] 



ORF Name 



NTID 



NT AA 
AAID — , — , score 
LengthLength 



IA17503000987 819575 c2 850 



|5440 ||1857 | |618 | [T53~ 



probability 
1 .4e-08 



Description 

gp: [GI:e247163:gl926361] [LN : LBPHIG1E] [AC:X98106] [GN:Rorf372] [OR : Bacteriophage 

phigle] [DB : genpept -phg] [DE : Lactobacillus bacteriophage phigle complete genomic DNA.] 
[LE: 13476] [RE: 14594] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _ — , score 
LengthLength 



probability 



117503000987 821012 cl 654 



[1559 | [ 5441 ||r75T]| 574 1 1^~|[ 5 . 8e-44 ~ 
Description 

pir: [LN:S49240] [AC:S49240 ] [PN : hypothetical protein 3 (capsulation locus)] 
[OR Haemophilus influenzae] [DB:pir2] >gp : [GI :g547513] [LN: HIACAPIID] [AC:Z37516] 
[OR : Haemophilus influenzae] [DB : genpept-bctl] [DE : Haemophilus influenzae serotype a 
capsulation locus region II DNA.] [NT:orf3] [LE:2838] [RE: 5207] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — — score 
LengthLength 



probability 



AI7503000987_833125_t3_493 

Description 

NO-HIT 



j [1570 | [5442 ||285 | 



404 



ORF Name 



NT AA 
NTI D AAID — , , _ — score 
LengthLength 



probability 



[1671 | [5443 |p^7— 1|128 | p7~ ||1 . 4e-30 



1 



IA17503000987Jj92U42_c3_925 

Description 

sp : [LN : YWDK_BACSU] [AC:P39619] [GN: YWDK: IPA-61D] [OR:BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 12.0 KD PROTEIN IN UNG-ROCA INTERGENIC REGION] [SP:P39619] 
[DBrswissprot] >pir: [LN:S39716] [AC : S39716 : F70054 ] [PNrywdK protein: hypothetical 
protein ipa-61d] [GNrywdK ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :g413985] 
[LN : BSGENR] [AC:X73124] [GN:ipa-61d] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:B. subtilis genomic region (325 to 333).] [SP:P39619] [LE:63169] [RE:63510] 
[DI:direct] >gp : [GI:ell86292 :g2636328] [LN : BSUB0020] [AC : Z99123 : AL009126] [GNrywdK] 
[FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate gene name: ipa-61d; 
similar to] [SP:P39619] [LE: 94452] [RE: 94793] [DI : complement] 



ORF Name 



NTID 



7v7vtt ^ NT AA 

AAID — — score 

Length Length 



IAI7503000987 972187 t2 208 



TSY2" 



[5444 |pT^|p07 |p^] [ 



probability 
[3.ie-83 



Description 

gp: [GI:g2689564] [LN:U93688] [AC:U93688] [PN : integrase] [GN:int] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus toxic shock syndrome toxin-1 
(tst) , enterotoxin (ent) , and integrase (int) genes, complete cds . ] [NT:similar to 
staphylococcal phage integrase] [LE: 13871] [RE: 15091] [DI: direct] 



ORF Name 



NTID 



MTO NT AA 

AAID — , — . score 

LengthLength 



probability 



IAI750300098 7 97655 7 7 c3 949 



] I 1675 | | 5445 H 804 IP 67 |p2g~||5.4e-19 



Description 

gp: [GI:g928839] [LN : BK5TATTP] [AC:L44593] [FN : unidentified] [OR : Lactococcus lactis 
phage BK5-T] [SR bacteriophage BK5-T DNA] [DB :genpept-phg] [DE : Bacteriophage BK5-T 
ORF»410, 3' end pf cds, 20 ORFs, repressorprotein, and Cro repressor protein genes, 
complete cds, ORF70'gene, 5' end of cds.] [NT:ORF266; putative] [LE:14882] [RE:15682] 
[DI : direct] 



ORF Name 



NTID 



_ _ _ _ NT AA 

AAID — — , score 

LengthLength 



probability 



IAI7503000987 977181 cl 702 



1674 | [5445 ||T5T5 [ |59 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IAI7503000987 978377 ci 589 



1575 | [5447 ||254 1 187 ||75 ||0 . 0057 



Description 

sp: [LN:VG05_VACCC] [AC:P21026] [GN: G5R] [OR:VACCINIA VIRUS] [SR : COPENHAGEN, ] 
[DE : PROTEIN G5] [SP:P21026] [DB : swissprot] >pir : [LN : A42512 ] [AC:A42512 ] [PN:G5R 
protein] [OR: vaccinia virus] [DB:pir2] >gp : [GI :g335417] [LN:VACCG] [AC:M35027] 
[OR: Vaccinia virus] [SR: Vaccinia virus (strain Copenhagen) DNA, clone VC-2] 
[DB:genpept-vrl] [DE:Vaccinia virus, complete genome.] [NT :G5R; putative] [LE:75218] 
[RE: 76522] [DI:direct] 



405 



ORF Name 



NT AA 
NTID A AID — « _ — ^ score 
LengthLength 



probability 



IA17503000987 9819392 c3 908 



] [1576 | [5448 ||789 | [262 ||I7g— ,4e-13 



Description 

gp: [GI:gl463023] [LN: CELF45E1] [AC:U28732] [GN:F45E1.3] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept- invl] [DE : Caenorhabditis 
elegans cosmid F45E1.] [LE : 13873 : 15298 : 15662 : 15851] [RE : 14009 : 15615 : 15806 : 16024] 
[DI : direct Join] 



ORF Name 

[Ar7503000987_9924055_c2_765 

Description 

NO-HIT 

ORF Name 



NTID 



AAID 



TZTT 



NT AA 
LengthLength 



score probability 



7T" 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



[AI7503000987 9926903 c2 83b 



][ 



17578" 



[5450 |pTT9~[|102 ||79 [[0.0032 



Description 

pir: [LN:F45681] [AC:F45681 ] [PN:orf 61.2] [OR:phage T2] [DB:pir2] >gp : [GI : g298525] 
[LN:S57515] [AC:S57515] [GN:orf 61.2] [OR : coliphage T2] [DB :genpept-phg] [DE:orf 61. 
{intergenic region between 41 and 61} [bacteriophage T2, Genomic, 323 nt] . ] [NT : This 
sequence comes from Fig. A3.] [LE:12] [RE:323] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID — , 



Length Length 



score 



AI7503000987 9940b2 C3 979 



probability 
■] [1679 | [5451 ||1536"[|511 ||130 ||0. 00012 



J 



Description 

gp: [GI :g4049717] [LN: AF063866] [AC : AF063866] [PN : ORF MSV156 hypothetical protein] 
[GN:MSV156] [OR : Melanoplus sanguinipes entomopoxvirus] [DB : genpept-vrl] [DE :Melanoplus 
sanguinipes entomopoxvirus, complete genome.] [LE: 140126] [RE: 143509] [DI:direct] 



ORF Name 



NTID 



_ NT AA 
AAID — _ — score 
LengthLength 



^7503000988 22939705 c3 35 



[5452 |p2~ ||133 |p" 



probability 
0 . 00020 



Description 

gp: [GI:g2689564] [LN:U93688] [AC:U93688] [PN : integrase] [GN:int] [OR : Staphylococcus 
aureus] [DB : genpep t - be t 2 ] [DE : Staphylococcus aureus toxic shock syndrome toxin- 1 
(tst) , enterotoxin (ent) , and integrase (int) genes, complete cds . ] [NT:similar to 
staphylococcal phage integrase] [LE: 13871] [RE: 15091] [DI: direct] 



ORF Name 



IAI 7 503000988 235762 12 11 



NT AA 

NTID AAID — _ — score 

LengthLength 



probability 



Description 
NO-HIT 



ORF Name 



^17503000988 23626577 c2 32 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



[5454 |pTu~|[69 ||56 [[0.0035 



Description 

gp: [GI :dl009788 :g829068] [LN : SHFORF] [AC:D50601] [PN:ORF10] [OR:Shigella sonnei] 
[SR: Shigella sonnei (individual_isolate HW3 83) DNA, clone pJK1142] [DB : genpept-bctl] 
[DE:Shigella sonnei DNA for 26 ORFs, complete cds.] [LE:5933] [RE:6628] [DI:direct] 



406 



ORF Name 

|A17b030009aB_24^t)70b^_tl_l 



NT ID 



NT AA 
AAID — , _. — . , score 
LengthLength 



probability 



[1683 | |54Sb |pT-| p 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — 4_ i_ t — , , score 
Length Length 



IAI7503000988 24642202 ci 30 



1 



1684 



probability 
5.8e-82 



Description 

gp: [GI:g2689554] [LN:U93688] [AC:U93688] [OR : Staphylococcus aureus] [DB :genpept-bct2] 
[DE: Staphylococcus aureus toxic shock syndrome toxin- 1 ( tst) , enterotoxin (ent) , and 
integrase (int) genes, complete cds . ] [NT:orf7] [LE:6109] [RE: 6708] [DI : complement] 



ORF Name 



NTID 



„ TO NT AA 

AAID — . — , , score 

LengthLength 



probability 



[A17503000988 2S663587 c2 33 



[1685 | [5457 |[g75~| | 291 ||327""|| j> . 6e-29 — | 
Description 

gp: [GI:g2689560] [LN:U93688] [AC:U93688] [OR : Staphylococcus aureus] [DB :genpept-bct2] 
[DE : Staphylococcus aureus toxic shock syndrome toxin-1 (tst ), enterotoxin (ent), and 
integrase (int) genes, complete cds.] [NT:orfl3] [LE:9717] [RE:10004] [DI : complement] 



ORF Name 



NTID 



„ w NT AA 

AAID — -i — _ score 

Length Length 



probability 



lA17b03000988 34428905 tl 3 



Description 
NO-HIT 



] |1686 | [5458 |[T55 1 154 | 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



AI7503000988 35428187 c3 37 



] [1687 | [5459 ||438 | |14B | 



Description 
NO-HIT 



ORF Name 



NTID 



^ TT ^ NT AA 

AAID — _ — , score 

LengthLength 



probability 



IAI7503000988 36601678 cl 29 



j [1688 | [5460 ||T72B~||S7g ~| |102 |fu".01Q 



Description 

gp: [GI:gl004289] [LN : PPRNAE14B] [AC:Z50050] [PN:Xylanase B] [OR : Penicillium 
purporogenum] [DB :genpept-plnl] [EC:3.2.1.8] [DE : P . purpurogenum mRNA for 
endo-1 , 4 -beta-xylanase . ] [LE : 11] [RE : 63 7] [DI : direct] 



ORF Name 



NTID 



7V7VTrN NT AA 

AAID — _ — . score 

LengthLength 



probability 



A*7503000988_4303i75_±i_i0 

Description 

NO-HIT 



[1689 | |546i [p5TT~| |5S | 



407 



ORF Name 



NT AA 
NTID AAID — . — A , scor e 
LengthLength 



probability 



[AI7503000988 9923175 Fl 20 



] [1590 | [5452 ||152 1 153 |g [[0.013 



Description 

sp : [LN:NU5M_CAEEL] [AC:P24896] [GN : ND5] [OR : CAENORHABDITIS ELEGANS] [EC:1.6.5.3] 
[DE : NADH- UBIQUINONE OXIDOREDUCTASE CHAIN 5,] [SP:P24896] [DB : swissprot] 
>pir: [LN:S26037] [AC : S26037 : S25810 ] [ PN : NADH dehydrogenase (ubiquinone), chain 5] 
[GN:ND5 ] [CL-.NADH dehydrogenase (ubiquinone) chain 5] [OR : mitochondrion Caenorhabditis 
elegans] [EC:1.6.5.3] [DB:pir2] >gp : [GI :g515886] [LN:MTCE] [AC : X54252 : S93745] 
[OR: Mitochondrion Caenorhabditis elegans] [SR : Caenorhabditis elegans] [DB : genpept-invl] 
[DE:C. elegans complete mitochondrial genome.] [NT :ND5 protein (AA 1 - 527)] 
[SP:P24896] [LE: 11691] [RE:13274] [DI: direct] 



ORF Name 



AI7503000990 10003756 F3 471 



Description 
NO-HIT 



NTID AAID — — , , score 
LengthLength 



■j [1691 [ [ 5463 [[OS— 1| 45 | 



probability 



ORF Name 



AI7503000990 10193763 c2 864 



Description 
NO-HIT 



NTID 



™ TT , NT AA 

AAID — — , score 

LengthLength 



probability 



1692 | [ 5454 | |1B6 ~|| 61 | 



ORF Name 



E 



17503000990 10194713 EI 31 



Description 
NO-HIT 

ORF Name 



XTmTr . NT AA 

NTID AAID — _^ — _ score 

LengthLength 

[1593 | [5455 |p^T"]| 48 | 



probability 



mm 7V7VTT , NT AA 

NTID AAID — — score 

Length Length 



probability 



[AI7503000990 10241287 c2 522 



[1694 | [5456 |[^3~| [ 220 [ | 74 [[UTTH 7 

Description 

gp: [GI:g924349] [LN : HIV1U13473 ] [AC:U13473] [PNienvelope glycoprotein V1V2 region] 
[GNrenv] [OR:Human immunodeficiency virus type 1] [DB :genpept-vrl] [DE:Human 
immunodeficiency virus type 1 isolate 037 clone 03 fromUganda, envelope glycoprotein 
(env) gene, V1V2 region, partial cds . ] [LE:<1] [RE:>285] [DI: direct] 



ORF Name 



^17503000990 10258812 c2 820 



Description 
NO-HIT 



NT AA 

NTID AAID Le ~ thLe ~ th score probability 



ORF Name 



|AI7503000990_10360902_t3__536 



Description 
NO-HIT 



NT AA 

NTID AAID Len ^ tnLe ~ tn score probability 



1695 | [5458 ||183~] p" 



408 



NT AA 

QRF Name ™ L e^thLe^th ^^ probability 



Al7S030009 90jL04a20a _cJ_948 | [ 1697 | |^469 || 1074 1 1 357 | pi> | | 4.4e-6 T 



Description 

sp: [LN:POTD_ECOLI] [AC:P23861] [GN : POTD] [OR : ESCHERICHIA COLI] 

[DE : SPERMIDINE/PUTRES CINE -BINDING PERIPLASMIC PROTEIN PRECURSOR (SPBP) ] [SP:P23861] 
[DB:swissprot] >pir: [LN:D40840] [AC :D40840 : H64856 ] [PN : spermidine/putrescine -binding 
protein precursor : spermidine/putrescine transport protein D] [GNipotD ] [OR: Escherichia 
coli] [DB:pir2] >gp : [GI : dl036929 : gl651550] [LN:D90747] [AC : D90747 : AB001340] 
[PN: Spermidine/putrescine transport protein D] [GN :potD] [OR : Escherichia coli] 
[SR: Escherichia coli (strain : K12 ) DNA, clone:Kohara clone #238] [DB : genpept-bctl] 
[DE: Escherichia coli genomic DNA. (25.2 - 25.6 min).] [NT : ORF_ID : o238#12 ; similar to 
PIR Accession Number] [LE:13470] [RE:14516] [DI : complement ] >gp : [GI : gl787367] 
[LN:AE000212] [AC : AE000212 :U00096] [PN : spermidine/putrescine periplasmic transport] 
[GN:potD] [FN: transport / Transport of small molecules: Amino] [OR Escherichia coli] 
[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 102 of 400 of the 
completegenome. ] [NT:f348; 100 pet identical to POTD_ECOLI SW: P23861] [LE:7785] 
[RE:8831] [DI : complement ] >gp : [GI :gl47329] [LN: ECOPOTABCD] [AC:M64519] [PN : transport 
protein] [GN :potD] [OR : Escherichia coli] [SR:E.coli (strain DR112) DNA, clone pPT104] 
[DB:genpept-bct2] [DE:E.coli transport protein (potA, potB, potC and potD) 
genes, complete cds . ] [LE:3144] [RE: 4190] [DI: direct] 

NT AA 

QRF Name NTID AAID — , — , _ score probability 

LengtnLengtn 



|ftl7S03000990jL0S7&000_ci_734 | [1698 | [ 5470 |[g57""| |3ifl [ pS - || b . 3e-71> 

Description 

sp: [LN:COXX_BACSU] [AC:P24009] [GN: CTAB] [OR : BACILLUS SUBTILIS] [DE : PROBABLE CYTOCHROME 
C OXIDASE ASSEMBLY FACTOR] [SP:P24009] [DB : swissprot] >pir : [LN : C69609] 
[AC:C69609:S14395 ] [PN : cytochrome caa3 oxidase (assembly factor) ctaB] [GN: ctaB ] 
[CL:heme O synthase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e334763 : g2339991] 
[LN:BS16823KB] [AC:Z98682] [PN:CtaB protein] [GN: ctaB] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE: Bacillus subtilis genomic DNA 23.9kB fragment.] [SP:P24009] 
[LE:4441] [RE:5358] [DI:direct] >gp : [GI :g994794] [LN:BSCTABF] [AC:X54140] 
[PN: cytochrome a assembly facto] [GN: ctaB] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC: 1.9. 3.1] [DE :B . subtilis ctaB-F genes for cytochrome a assembly factor 
andcytochrome-c oxidase (EC 1.9.3.1) subunits II, I, II, and IVB.] [NT : putative] 
[SP:P24009] [LE:408] [RE:1325] [DI: direct] >gp : [GI : ell85078 : g2633859] [LN : BSUB0008] 
[AC:Z99111:AL009126] [PN : cytochrome caa3 oxydase assembly factor] [GN : ctaB] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791tO 1603020.] [SP:P24009] [LE:163924] [RE:164841] [DI:direct] 

NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



^7503000S90_i063753_ca_674 | [1595 | [ 5471 |[TT7¥] | 457 | [ 2070 || 3 . 3e-2i4 



Description 

sp: [LN:MURD_STAAU] [AC : 03 3595: 00 7323] [GN:MURD] [OR : STAPHYLOCOCCUS AUREUS] [EC:6.3.2.9] 
[DE : ADDING ENZYME) ] [SP : 033595 : 007323 ] [DB : swissprot] >pir : [LN : JC6560] [AC:JC6560 ] 
[PN : UDP-N-acetylmuramoylalanine- -D-glutamate ligase , 

:UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase] [GN:murD ] [OR : Staphylococcus 
aureus] [EC:6.3.2.9] [DB:pir2] >gp : [GI :g2271510] [LN : AF009671] [AC : AF009671] 

[PN:UDP-N-acetylmuramoyl-L-alanine : D-glutamate] [GN:murD] [FN: catalyzes ATP-dependent 
D-glutamate addition] [OR : Staphylococcus aureus] [DB : genpept-bct2] [DE : Staphylococcus 
aureus UDP-N-acetylmuramoyl-L-alanine : D-glutamateligase (murD) gene, complete cds.] 

[NT :MurD] [LE:1] [RE: 1350] [DI: direct] 



409 



ORF Name 



NT AA 
NT ID AAID ^ — » _ — score 
Length Length 



probability 



IAI7503000990 10740628 El 637 



J |1700 | |5472 [|183 | pT" 



Description 
NO-HIT 



ORF Name 



NT AA 
NT ID AAID _ . , . — 



IAI7503000990 10928 c3 959 



LengthLength probability 
TTTT^ f -4 ^— ||i242 | WT2 |[27T 



l.le-17 



Description 

pir: [LN:T03492] [AC:T03492 ] [PN : hypothetical protein] [OR : Rhodobacter capsulatus] 
[DB:pir2] [MP:1 ] >gp : [GI :g3128293] [LN : AF010496] [AC : AF010496] [PN : hypothetical 
protein] [OR : Rhodobacter capsulatus] [DB : genpept-bct2] [DE : Rhodobacter capsulatus 
strain SB1003, partial genome.] [LE: 54291] [RE: 55613] [DI : complement] 



ORF Name 



NT 



AA 



NT ID AAID — , 



LengthLength 



score 



17503000990 10969052 c3 935 



1702 | |S474 ||T7?T] p7T" 



probability 
1.4e-270 



Description 

sp: [LN:PT1_STAAU] [AC:P51183] [GN:PTSI] [OR : STAPHYLOCOCCUS AUREUS] [EC:2.7.3.9] 
[DE: (PHOSPHOTRANSFERASE SYSTEM, ENZYME I)] [SP:P51183] [DB : swissprot] >gp : [GI : gl070386 ] 
[LN:SAPTSHI] [AC:X93205] [PN : phosphoenolpyruvate -protein phosphatase] [GN:ptsI] 
[OR staphylococcus aureus] [DB : genpept-bctl] [EC: 2. 7. 3. 9] [DE : S . aureus ptsH and ptsl 
genes.] [SP:P51183] [LE:462] [RE:2180] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



[AI7503000990 10972150 c3 1037 



TUFT 



[5475 ||2055 | |S5T" 



probability 
3024 110. 0 



Description 

sp: [LN:RECG_STAAU] [AC:O50581] [GN : RECG] [OR : STAPHYLOCOCCUS AUREUS] [EC:3.6.1.-] 
[DE : ATP -DEPENDENT DNA HELICASE RECG,] [SP: 050581] [DB : swissprot ] 
>gp: [GI:dl025491:g2826896] [LN : AB000439] [AC : AB000439] [PN:RecG] [GN : recG] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain : RN4220) DNA] 
[DB: genpept-bctl] [DE : Staphylococcus aureus recG gene, complete cds . ] [LE:319] 
[RE: 2379] [DIrdirect] 



ORF Name 



NTID 



NT 



AA 



AAID . — . , . 



Length Length 



score 



1175030009 90 1173177 t3 667 



] [1704 | [5476 ||1053||350 |pi" 



probability 
4.4e-68 



Description 

Sp : [LN : YAAF_ECOLI ] [AC:P22564] [GN : YAAF] [OR : ESCHERICHIA COLI] [DE : HYPOTHETICAL 32.6 KD 
PROTEIN IN LYTB-DAPB INTERGENIC REGION] [SP:P22564] [DB : swissprot] >pir: [LN:JE0404] 
[AC: JE0404 :S40553 :F64723 :S22291 ] [PN:probable glycosidase, yaaF] [GN:yaaF ] [CL:yaaF 
protein] [OR : Escherichia coli] [EC:3.2.-.-] [DB:pir2] >gp : [GI : g41934] [LN: ECLSPDAP] 
[AC:X54945] [GN : ORF 3] [OR : Escherichia coli] [DB : genpept-bctl] [DE : E . coli Isp-dapB 
interval.] [NT:product appears to be membrane bound] [SP:P22564] [LE:1613] [RE:2527] 
[DI:direct] >gp: [GI :dl 001780 :g2 16457] [LN:ECO110K] 

[AC:D10483 : J01597 : J01683 : J01706 :K01298 :K01990 :M1042 0 :M10611 :M12544] [OR : Escherichia 
coli] [SR: Escherichia coli (strain : K-12) DNA] [DB : genpept-bctl] [DE:E.coli K12 genome, 
0-2.4min. region.] [NT : hypothetical 32 . 6K protein (PIR : JE0404) ] [LE:26947] [RE:27861] 
[DIrdirect] >gp: [GI :gl786213] [LN : AE000113] [AC : AE000113 :U00096] [PN:orf, hypothetical 
protein] [GN:yaaF] [FN:orf; Unknown] [OR : Escherichia coli] [DB : genpept -bct2 ] 
[DE: Escherichia coli K-12 MG1655 section 3 of 400 of the completegenome . ] [NT:o304; 100 
pet identical to YAAF_ECOLI SW: P22564] [LE:6634] [RE: 7548] [DI: direct] 



410 



ORF Name 



NTID 



NT AA 
AAID — m _ — score 
LengthLength 



probability 



U.7503000990 1178828 c2 810 



1705 | [5477 ||705 | [234 | [STS - ||i . 3e-63 



Description 

gp: [GI :g4097530] [LN : LLU64311] [AC:U6431l] 

[PNrphosphoribosylaminoimidazolesuccinocarboxamide] [GNrpurC] [OR : Lactococcus lactis] 
[DB:genpept-bct2] [EC:6.3.2.6] [DE : Lactococcus lactis 
phosphor ibosylaminoimidazolesuccinocarboxamidesynthetase (purC) , 

phosphoribosylformylglycinamidine synthetase I (purQ) , phosphoribosylf ormylglycinamidine 
synthetase II (purL) , andphosphoribosylpyrophosphate amidotransf erase (purF) genes, 
completecds; and unknown gene.] [NT : SAICAR synthetase] [LE:792] [RE:1502] [DIrdirect] 



ORF Name 



NTID 



____ NT AA 

AAID — _ — .score 

LengthLength 



probability 



IAI7503000990 1179542 c2 839 



TTuT" 



[5478 ||222 | 



Description 
NO-HIT 



ORF Name 



NT AA 
NTID AAID — — score 
LengthLength 



probability 



AI7503000990 11806512 t3 562 



[1707 | [ 5479 | pTT7 — 1 1 6 B | p || 0.00E>0 — | 

Description 

gp: [GI:el344613 :g3874900] [LN:CEC41G6] [AC:Z81047] [GN:C41G6.8] [OR : Caenorhabditis 
elegans] [DB : genpept - invl ] [DE : Caenorhabditis elegans cosmid C41G6, complete sequence.] 
[LE : 18 951: 19702 : 20184] [RE : 19549 : 20043 : 2 0463] [DI : direct Join] 



NT AA 

ORF Name NTID AAID — — , , scor e probability 
LengthLength 



[AI7503000990_1204567_c2_897 | [ 1708 | [ 5480 | [7?u~[ [ 249 | pi~|| 8 . Oe-46 

Description 

pir: [LN:G69878] [AC:G69878 ] [PN : conserved hypothetical protein yloO] [GN:yloO ] 
[CL: conserved hypothetical protein yloO: conserved hypothetical protein yloO homology] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85167 :g2633948] [LN : BSUB0009] 
[AC:Z99112 :AL009126] [GN:yloO] [FN : unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:similar to hypothetical proteins] [LE:51363] [RE:52127] [DI:direct] 

>gp: [GI:e323505:g2337805] [LN : BSY13 937] [AC:Y13937] [PN:putative Ptcl protein] 
[GN:yloO] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis 

genomic DNA from the spoVM region.] [LE: 13135] [RE: 13899] [DI: direct] 

raw— «^ -»T^T.II — ^11— — _ - ... ■„..— ■.,„■ — i— n — i. ^— ,, iiii m i ij iii JM .I ^ — i n l | f .ii i i — I. I ,_ , . . ■ r- ., „„ ,,. .. . „„, . „ . .,,,. „ . _, . .„ ,_ „„ . ,.„„„ ._... „ .. . .. „ .. _ rii ||[|[i 

NT AA 

ORF Name NTID AAID — — , , scor e probability 

LengthLength J - 



A:f7503000990_i209£38_ci_775 | [1709 | [ 5481 |[5*P~| [ 179 | p2~ || 6 . 9e-63 



Description 

Sp : [LN : LSPA_STAAU] [AC:P31024] [GN : LSPA: LSP] [OR : STAPHYLOCOCCUS AUREUS] [EC : 3 . 4 . 23 . 36 ] 
[DE: PEPTIDASE) (SIGNAL PEPTIDASE II) (SPASE II)] [SP:P31024] [DB : swissprot] 
>pir: [LN:S20433] [AC:S20433 ] [PN : lipoprotein signal peptidase,] [CL : lipoprotein signal 
peptidase] [OR: Staphylococcus aureus] [EC : 3 . 4 . 23 . 36] [DB:pir2] >gp : [GI : gl53045] 
[LN : STALSP] [AC : M83994 : M84707] [PN : prolipoprotein signal peptidase] [GN:lsp] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus DNA] [DB : genpept -bet 1] 
[DE: Staphylococcus aureus prolipoprotein signal peptidase (lsp) gene , complete cds . ] 
[LE:1213] [RE: 1704] [DI: direct] 



411 



ORF Name 

1A17503000990 120%8 2 c2 855 



NT AA 
NTID AAID — , _ — , , score 
LengthLength 



probability 



[1710 | [5482 ||2409 ||802 | p3T]|4 . 3e-205 



Description 

sp: [LN:SYFB_BACSU] [AC : P17922 : P9454 0] [GN : PHET] [OR : BACILLUS SUBTILIS] [EC : 6 . 1 . 1 .20] 
[DE : TRNA LIGASE BETA CHAIN) (PHERS) ] [SP : P17922 : P9454 0] [DB : swissprot] >pir: [LNrYFBSB] 
[AC :A69676: 140460 :S11731 ] [PN : phenylalanine- -tRNA ligase, beta chain : phenyl alanyl-tRNA 
synthetase beta chain] [GNrpheT ] [CL phenylalanine --tRNA ligase beta chain] 
[OR:Bacillus subtilis] [EC : 6 . 1 . 1 . 20] [DB:pirl] >gp : [GI : ell84112 :g2635328] [LN: BSUB0015] 
[AC:Z99118:AL009126] [PN : phenyl alanyl -tRNA synthetase (beta subunit) ] [GN :pheT] 
[ORrBacillus subtilis] [DB :genpept-bctl] [EC : 6 . 1 . 1 . 20] [DE:Bacillus subtilis complete 
genome (section 15 of 21): from 2795131to 3013540.] [SP:P17922] [LE:130980] [RE:133394] 
[DI: complement] >gp : [GI:ell65325:gl 770031] [LN : BSZ75208] [AC:Z75208] 
[PN:phenylalanyl-tRNA synthetase beta subunit] [GN : pheT] [ORrBacillus subtilis] 
[DB:genpept-bctl] [EC : 6 . 1 . 1 . 20] [DE : B . subtilis genomic sequence 89009bp.] 
[NT:phenylalanyl-tRNA synthetase beta subunit] [SP:P17922] [LE: 36513] [RE: 38927] 
[DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



E 



17503000550 13720312 c3 1024 



T7TT" 



[5483 |p?u~[|75 |pTT 



probability 
3.0e-07 



Description 

pir: [LN:C69878] [AC:C69878 ] [PN : hypothetical protein yloH] [GN:yloH ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell85160 : g2633941] [LN : BSUB0009] [AC : Z99112 : AL009126] 
[GN: yloH] [FN: unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 9 of 21) : from 1598421to 1807200.] [LE:43546] [RE:43749] 
[DI:direct] >gp : [GI : e323522 :g2337798] [LN :BSY13937] [AC:Y13937] [PN:putative rpoZ 
protein] [GNryloH] [FNrunknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis genomic DNA from the spoVM region.] [LE:5318] [RE: 5521] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID , , 

LengthLength 



score probability 



|AI7503000990_i3757785_r3_639 
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Description 
NO-HIT 



ORF Name 



NTID 



^ m NT AA 

AAID — -| — , , score 

LengthLength 



IAI7503000990 1376317 c2 881 



"| [1713 | [5485 |[E^~[|187 |pT" 



probability 
8.0e-46 



Description 

gp: [GI:el99384:gl514599] [LN : LPPYRBSOP] [AC:Z54240] [GN:pyrR] [FN : regulatory protein] 
[OR Lactobacillus plantarum] [DB :genpept-bctl] [DE : L . plantarum pyrimidine biosynthetic 

operon (pyrR, pyrB, pyrC,pyrAA, pyrAB, pyrD, pyrF, and pyrE) genes.] [LE:781] [RE:1323] 
[DI : direct] 



ORF Name 



NTID 



NT 

AAID — 



AA 



score probability 



AI7503000590_13797076_c3_9i5 

Description 

NO-HIT 



1714 
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ORF Name 



NTID 
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NT AA 
AAID — , — . . score 
Length Length • 



probability 
"] [1715 | [5487 |pi~ ||77 | p5~ ] |1 . 4e-14 



Description 

pir: [LN:D69871] [AC:D69871 ] [PN : hypothetical protein ykzG] [GN:ykzG ] [OR .-Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell85044 : g2633825] [LN :BSUB0008 J [AC : Z99111 : AL009126] 
[GNrykzG] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:129406] [RE:129615] 
[DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



. — « — _ score probability 
LengthLength c 



IAI7503000990 13859091 £2 433 



] [1716 | [5488 |pi~||97 ||245 ||5.4e-21 



Description 

sp: [LN:QOX4_BACSU] [AC:P34959] [GN : QOXD : IPA-40D] [OR:BACILLUS SUBTILIS] [EC:1.9.3.-] 
[DE:SUBUNIT QOXD)] [SP:P34959] [DB : swissprot] >pir : [LN:D38129] [AC : D3812 9 : S3 9695 : H69687 
] [PN: quinol oxidase aa3-600 chain IV qoxD : cytochrome aa3 quinol oxidase (subunit IV) 
qoxD] [GN: qoxD ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :gl43399] [LN:BACQOXA] 
[AC:M86548] [PN:quinol oxidase] [GN:QOXD] [OR:Bacillus subtilis] [SR:Bacillus subtilis 
DNA] [DB:genpept-bctl] [DE:Bacillus subtilis AA3-600 quinol oxidase (QOXA, QOXB, QOXC, 
QOXD)genes, complete cds . ] [LE:4425] [RE:4799] [DI .-direct] >gp : [GI : g413964] [LN: BSGENR] 
[AC:X73124] [GN:ipa-40d qoxD] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE :B. subtilis 
genomic region (325 to 333).] [SP:P34959] [LE:42877] [RE:43251] [DI:direct] 
>gp: [GI:ell86313:g2636349] [LN : BSUB0020] [AC : Z99123 : AL009126] [PN : cytochrome aa3 quinol 
oxidase (subunit IV)] [GN: qoxD] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: ipa-40d] [SP:P34959] [LE:114711] [RE:115085] [DI : complement] 



ORF Name 



NTID 



™™ NT AA 

AAID — — , , score 

LengthLength 



probability 



E 



17503000990 13915017 ci 738 



Description 
NO-HIT 
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IAI7503000990 14450882 c2 840 



NT AA 
NTID AAID — ^ — , , score 
LengthLength 



probability 



[1718 | [5490 |p^u~[[79 ||321 ||7.2e-29 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _ T — , , score 
LengthLength 



probability 



AI7503000990_i4SS7882_i3_S32 

Description 

NO-HIT 



"J |1719 | |5491 |[TT5 [ |41 | 
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NT AA 

QRF Name NTID AMD Le — thLe — th score probability 



AI7503000990_i464213b_c3_i)57 | | i720 | [ 5492 | | 190b 1 1634 | [ 2455 || 4 . le-255 



Description 

Sp : [LN : TYPA_BACSU] [AC:O07631] [GN: TYPA] [OR : BACILLUS SUBTILIS] [DE : GTP- BINDING PROTEIN 
TYPA/BIPA HOMOLOG] [SP:O07631] [DB : swissprot] >pir : [LN: E69872] [AC:E69872 ] 
[PN:GTP-binding elongation factor homolog ylaG] [GNiylaG ] [CL : translation elongation 
factor Tu homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85067 :g2633848] 
[LN:BSUB0008] [AC : Z99111 : AL009126 ] [GNiylaG] [FN:unknown] [ORiBacillus subtilis] 
[DB .-genpept-bctl] [DE:Bacillus subtilis complete genome (section 8 of 21) : from 
1394791to 1603020.] [NTisimilar to GTP-binding elongation factor] [SP:O07631] 
[LE:150736] [RE: 152574] [DI:direct] >gp : [GI :ell91893:g2 224766] [LN : BSZ97025] 
[AC:Z97025] [GN: ylaG] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
nprE, yla [A, B, C, D, E, F, G, H, I,J,K,L,M,N,0] and pycAgenes . ] [NT:product highly similar to 
elongation factor EF-G] [SP:O07631] [LE:4995] [RE: 6833] [DI : direct] 

NT AA 

QRF Name NT ID AAID — _ , — , , score proba bility 
LenqthLength c L - 



|AI7S03000&90J.4644066_c2_56S | [1721 | |S493 |p^~|p ||7S ||0.0032 



Description 

gp: [GI:g3212079] [LN : AF068633 ] [AC : AF068633 ] [PN:phenol soluble modulin beta 1] 
[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB :genpept-bct2] 
[DE : Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds . ] [NT:PSM beta 1] [LE:669] [RE: 803] [DI: direct] 

>gp: [GI:g3212080] [LN : AF068633] [AC : AF068633] [PN:phenol soluble modulin beta 2] 

[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB :genpept-bct2] 

[DE: Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds.] [NT:PSM beta 2] [LE:859] [RE:993] [DIrdirect] 

NT AA 

QRF Name NTID AAID — — , , score probability 
LengthLength ^ 

[A!T7503000990_14650302_cij588 | [1722 | |b494 |87 || 179 || 8.0e-14 [ 

Description 

Sp : [LN : YEXA_BACSU] [AC:P12049] [GN : YEXA] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 9.7 KD 
PROTEIN IN PURC-PURL INTERGENIC REGION] [SP:P12049] [DB : swissprot] >pir: [LN:E29326] 
[AC:E29326 :E69799 ] [PN : conserved hypothetical protein yexA: hypothetical protein (pur 
operon) ] [GN:yexA ] [CL : conserved hypothetical protein MJ1593] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : el 182 626 :g2 632960] [LN: BSUB0004] [AC : Z 99107 : AL009126] [GN :yexA] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 4 of 21): from 600701 to813890.] [NT:similar to hypothetical proteins] 
[LE : 101124] [RE : 101378] [DI : direct] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000990_i46g40i2_t2_323 | [1723 | |549b || 4g8 ^ |155 | pTT~ | |5 . 4e-3i 



Description 

pir: [LN:E69875] [AC:E69875 ] [PN : hypothetical protein ylbP] [GN:ylbP ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : e334785 :g2340013] [LN : BS 16 8 2 3KB] [AC:Z98682] [PN:YlbP 
protein] [GN:ylbP] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
genomic DNA 23.9kB fragment.] [LE: 21896] [RE: 22378] [DI : complement] 

>gp: [GI:ell85100:g2633881] [LN: BSUB0008] [AC : Z99111 : AL009126] [GN:ylbP] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [LE:181379] [RE:18186l] [DI : complement] 



414 



ORF Name 


NT AA 
NT ID AAID _ — . , _ — . , score 
Leng t n Lteng lii 


probability 


AI7503000990 148blb8/_CJ 1010 


j |1724 |5496 |126 | |41 j 




Description 






ORF Name 


NT AA 
NT ID AAID , — . , . — , , score 

J-icIig LllJ-icilly LII 


probability 


u\l 750300099 0_14 8750 0 2_t 2_4 0 8 


1725 15497 141 1146 1185 


10 . 0044 


Description 

gp: [GI :g488889] [LN:A12521] [AC:A12521] [PN:Acidic Basic Repeat Antigen Rhoptry (ABRA) ] 
[OR: Plasmodium falciparum] [SR: malaria parasite P. falciparum] [DB :genpept-pat] 
[DE:Agl89 clone.] [LE:1:61] [RE: 45: 963] [DI : direct Join] 


ORF Name 


NT AA 
NTID AAID „ — , , m — . , score 
■ - Length Length 


probability 


AI7503000990_156502_£i_i79 


| |1726 | |5498 | 243 | |S0 | 




Description 
NO-HIT 






Ann >t -~ _ 

ORF Name 


■fc-rm-ri-N n -r NT AA 

NTID AAID _ — . , „ — , score 
Length Length 


probability 


AI7S03000990_i6S93800_i2_359 


1727 | |5499 144 1 147 | 




Description 
NO-HIT 






ORF Name 


XTm-i-fk t\ t\ t" r\ NT AA 

NTID AAID , — . , _ — . , score 
Length Length 


probability 


^I7503000990_19S37562_r2_429 


| |1728 | |BB00 ||198 1 165 | 




Description 
NO-HIT 






ORF Name 


NT AA 
NTID AAID _ — . , , — . , score 
LeiigLh Length 


probability 


A17b03000990_19537878_tl_177 


| |1729 | |550i | 159 1 152 | 




Description 
NO-HIT 






ORF Name 


NTID AAID _ — , , — , score 
Length Length 


probability 


AI7503000990_i955S32S_ti_94 


| |1730 | |5502 ||14i ||4£ | 




Description 







NO-HIT 
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ORF Name 



NT ID 



NT AA 
AAID — _ T — , , score 
Length Length 



probability 



IAI7503 



000990 19557752 cl 777 



] [ 1731 | [ 5503 || 912 ||303 | . 7 e-84 



Description 

sp: [LN:PYRB_BACSU] [AC:P05654] [GN : PYRB] [OR:BACILLUS SUBTILIS] [EC:2.1.3.2] 
[DE: TRANS CARB AMYLASE) (ATCASE) ] [SP:P05654] [DB : swissprot ] >pir: [LN : OWBSAC] 
[AC:A25015 :C39845 :B69686 ] [PN : aspartate carbamoyl transf erase, catalytic 
chain: aspartate transcarbamoylase] [GN:pyrB ] [CL : ornithine carbamoyl transf erase : 
aspartate/ornithine carbamoyl transf erase homology] [OR: Bacillus subtilis] [EC: 2. 1.3. 2] 
[DBrpirl] [MP:37 min ] >gp : [GI :gl43384] [LN : B AC PYRB] [AC:M13128] [OR:Bacillus subtilis] 
[SR:B. subtilis (strain JH861) DNA, clone pLS201] [DB : genpept-bctl] [DE : B . subtilis pyrB 
gene encoding aspartate transcarbamoylase, completecds . ] [NT : aspartate 
transcarbamoylase (EC 2.1.3.2)] [LE:96] [RE: 1010] [DI: direct] 
>gp: [GI:ell85141:g2633922] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : aspartate 
carbamoyl transf erase] [GN:pyrB] [ FN : pyr imidine biosynthesis] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [EC:2.1.3.2] [DE:Bacillus subtilis complete genome (section 9 of 21): 
from 1598421to 1807200.] [SP:P05654] [LE:21455] [RE:22369] [DI:direct] >gp : [GI :gl43387] 
[LN : BACPYROP] [AC:M59757] [PN : aspartate transcarbamylase] [GN:pyrB] [OR:Bacillus 
subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis ORF1A (pyrR) , putative membrane -bound 
uracilpermease (pyrP) , aspartate transcarbamylase (pyrB) , dihydroorotase (pyrC) , 
glutaminase of carbamyl phosphate synthetase (pyrAA) , carbamyl phosphate synthetase 
(pyrAB) , dihydroorotase dehydrogenase (pyrD) , OMP decarboxylase (pyrF) , and OMP-PRPP 
transferase (pyrE)genes, complete cds; and unknovm gene.] [LE:2859] [RE:3773] 
[DI :direct] 



ORF Name 



NTID 



7V7VTr , NT AA 

AAID — , — .score 

LengthLength 



probability 



|AI7503000990_I9555627_c2_879 



] [1732 | |bb04 |p^~ ||43 | 



Description 
NO-HIT 

ORF Name 



NTID 



MT „ NT AA 

AAID — _ score 

LengthLength 



probability 



^17503000990 19609530 ti 155 



Description 
NO-HIT 



[1733 | [5505 |pH~||48 | 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score probability 



IA17503000990 19804703 c2 811 



] I 1734 1 l 1 ^ 06 |p47~| p48 ||696 ||1.3e-6B 



Description 

sp: [LN:PURQ_BACSU] [AC:P12041] [GN : PURQ] [OR:BACILLUS SUBTILIS] [EC:6.3.5.3] 
[DE : SYNTHASE I)] [SP:P12041] [DB : swissprot] >pir : [LN : SYBS1G] [AC : F29326 : H6 9684 ] 
[PN:phosphoribosylformylglycinamidine synthase, component I] [GN:purQ ] 
[CL: phosphor ibosylf ormylglycinamidine synthase component I] [OR: Bacillus subtilis] 
[EC:6.3.5.3] [DB:pirl] [MP:18 rain ] >gp : [GI :gl43368] [LN : BACPURF] [AC : J02732 : K00047] 
[OR:Bacillus subtilis] [SR:B. subtilis (strain DEI (prototroph DER. or W168) ) DNA, clone 
pPZ] [DB:genpept-bctl] [DE : B . subtilis pur operon encoding purine biosynthesis enzymes, 
12genes.] [NT : phosphor ibosylf ormyl glycinamidine synthetase I] [LE:4393] [RE: 5076] 
[DI: direct] >gp: [GI:ell82627:g2 632961] [LN : BSUB0004] [AC : Z99107 : AL009126] 
[PN: phosphor ibosylf ormylglycinamidine synthetase II] [GN :purL] [FN:purine biosynthesis] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [EC:6.3.5.3] [DE:Bacillus subtilis complete 
genome (section 4 of 21): from 600701 to813890.] [SP:P12041] [LE:101375] [RE:102058] 
[DI: direct] 



416 



ORF Name 



NT AA 
NT ID AAID — T — , , score 
LengthLength 



probability 



|A1 7 b03000990 199b79 02 el 752 



J [1735 | [ 5507 ||IB9g-]|63i | p5T]| 9 . be-288 



] 



Description 

gp: [GI:el333202:g3776112] [LN:SATRXA] [AC : AJ223480] [PN : excinuclease ABC, subunit C] 
[GNiuvrC] [FNrexcision of ultraviolet light -induced] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE : Staphylococcus aureus trxA and uvrC genes and partial mutS and 

dhsCgenes.] [NT : alternative gene name: uvrB] [LE:2972] [RE:4753] [DI: direct] 



ORF Name 



NT ID 



^ m NT AA 

AAID — , — , . score 

LengthLength 



probability 



|A:t7g03000990_203i2BiS_t2_242 



TTTZ~ 



141 I 155" 



Description 
NO-HIT 



ORF Name 



>TmTn NT AA 

NTID AAID — _ — _ score 

LengthLength 



probability 



U.7b03000990 20500055 tl 218 



Description 
NO-HIT 



j [1737 | [ 5509 || 132 j | 4i | 



ORF Name 



NTID 



AI7503000990_205022i7_c3_990 



„ TO NT AA 
AAID — . T — ^ score 
LengthLength 



[1738 | [5510 ||i625"||54i | [TUT 



probability 
|6.1e-7i 



Description 

pir: [LN:G69875] [AC:G69875 ] [PN : hypothetical protein yllA] [GN : yllA ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI : ell85102 :g2633883] [LN : BSUB0008] [AC : Z99111 : AL009126] 

[GN : yllA] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:182988] [RE:184607] 

[DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000990 20S7S27 c2 849 



Description 
NO-HIT 



] [1739 | [5511 ||129 | [42 | 



ORF Name 



NT AA 

^ AAID Le ~ thLe - th score probability 



[AX7503000990_2089087b_c2J301 



Description 
NO-HIT 



] [1740 | [5512 j[T¥7 | [48 [ 



ORF Name 



NT AA 

^ MID Le - thLe - th gcore probability 



| &I75030005tfO_205840_tl_44 



Description 
NO-HIT 



j [1741 | [5513 ||T55~~| p£ 
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ORF Name 



NT 

NTID AAID . — 



AA 



LengthLength 



score probability 



E 



17503000990 20994212 tl 151 



] [1742 | [5514 |pT-||68 | [71 ||0.022 



Description 

pir: [LN:D69872] [AC:D69872 ] [PN : hypothetical protein ylaF] [GNrylaF ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell85066 :g2633847] [LN: BSUB0008] [AC : Z99111 : AL009126] 
[GNrylaF] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:150435] [RE:150623] 
[DI: complement] >gp : [GI : e324391 :g2224765] [LN : BSZ97025] [AC:Z97025] [GN: ylaF] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis nprE, 

yla [A, B, C, D, E, F,G,H,I,J,K,Ij r M,N,0] and pycAgenes . ] [LE:4694] [RE:4882] [DI : complement] 



ORF Name 



^17503000990 211677 c3 989 



NT 



AA 



NTID AAID — UT — _ 
LengthLength 



score 



T7TT 



[5515 |[702 ||233 



probability 
|4.7e-4i 



Description 

pir: [LN:E6 9814] [AC:E69814 ] [PN : conserved hypothetical protein yfnB] [GN:yfnB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : el!82712 :g2633046] [LN : BSUB0004] 
[AC:Z99107:AL009126] [GN:yfnB] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[NT: similar to hypothetical proteins] [LE: 203465] [RE: 204172] [DI : complement] 
>gp: [GI:ell82723:g2633057] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN:yfnB] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NT:similar to hypothetical proteins] [LE:1345] 
[RE:2052] [DI : complement ] >gp : [GI : dl020926 : g2116 760] [LN:D86418] [AC:D86418] [PN: YfnB] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain :AC327) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis genomic DNA 69-70 degree region, partialsequence . ] [LE:9019] 
[RE: 9726] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



T — i_i_ T — ■ * score probabilit y 
LengthLength c *- 



E 



I7S03000990 2164S962 c3 930 



] [1744 | [5515 |pTT|p8 | [1793 ||7 . 4e-185 



] 



Description 

gp: [GI:g517205] [LN: SPU09352] [AC:U09352] [PN:67 kDa Myosin -cross reactive 
streptococcal] [OR : Streptococcus pyogenes] [DB : genpept-bctl] [DE : Streptococcus pyogenes 
42 KD protein (ORF1) gene and 67 KDMyosin-crossreactive streptococcal antigen gene, 
complete cds.] [NT : ORF2] [LE:1734] [RE: 3506] [DI:direct] 



ORF Name 



NTID 



„ TO NT AA 
AAID — — score 
LengthLength 



IAI7503000990 21656327 cl 761 



11745 



5517 IISTS 



probability 
| |1464 ||FTTe-150 



Description 

gp: [GI:g2149891] [LN: SAU94706] [AC:U94706] [PN:unknown] [GN:yllC] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus strain ATCC 8325-4 cell wall/cell 
divisiongene cluster, yllB, yllC, yllD, pbpA, mraY, murD, divlB, ftsA andftsZ genes, 
complete cds.] [LE:987] [RE: 1922] [DI:direct] 



418 



ORF Name 



NTID 



NT AA 
A AID — • — , , score 
LenqthLength 



probability 



U7503000990 2I7b9fcbJ t3 566 



1745 | [5518 ||509 | [202 |[552—||2 ,4e-53 



] 



Description 

sp: [LN:QOX3_BACSU] [AC:P34958] [GN :QOXC : IPA-39D] [ORrBACILLUS SUBTILIS] [ECrl.9.3.-] 
[DE:SUBUNIT QOXC) ] [SPrP34958] [DB : swissprot] >pir : [LN : C38129] [AC : C38129 : S39694 :G69687 
] [PNrbo-type ubiquinol oxidase, chain III qoxC : cytochrome aa3 quinol oxidase (subunit 
III) qoxC] [GN:qoxC ] [CL : cytochrome -c oxidase chain III] [OR:Bacillus subtilis] 
[EC:1.10.3.-] [DB:pir2] >gp : [GI : g!43398] [LN:BACQOXA] [AC:M86548] [PN: quinol oxidase] 
[GNrQOXC] [ORrBacillus subtilis] [SRrBacillus subtilis DNA] [DBrgenpept-bctl] 
[DE:Bacillus subtilis AA3-600 quinol oxidase (QOXA, QOXB, QOXC, QOXD)genes, complete 
cds.] [LE:3809] [RE:4423] [DIrdirect] >gp : [GI rg413963] [LN: BSGENR] [AC:X73124] 
[GNripa-39d qoxC] [ORrBacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic 
region (325 to 333).] [SP:P34958] [LE:42261] [RE:42875] [DI: direct] 

>gp: [GI:ell86314:g2636350] [LN : BSUB0020] [AC : Z99123 : AL009126] [PN : cytochrome aa3 quinol 
oxidase (subunit III)] [GN:qoxC] [ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: ipa-39d] [SP:P34958] [LE:115087] [RE: 115701] [D I : complement] 



ORF Name 



NTID 



IA17503000990 21915911 ii 197 



][ 



T7T7" 



NT AA 

AAID 

LengthLength 

[5519 ||894 | [257 



score probability 
1229 ||4.0e-19 



Description 

pir: [LN:G69858] [AC:G69858 ] [PN : hypothetical protein ykoC] [GNrykoC ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : ell81521 rg2632041] [LN : BSAJ2571] [AC r AJ002571] [PNrYkoC] 
[GNrykoC] [FNrunknown] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis 
168 56 kb DNA fragment between xlyA and ykoR.] [LEr 40195] [RE : 40959] [DI r complement] 
>gpr [GIrell83341rg2633675] [LN r BSUB0007] [AC r Z99110 r AL009126] [GNrykoC] [FNrunknown] 
[ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete genome (section 
7 of 21) r from 1194391to 1411140.] [LErl93144] [RErl93908] [D I r complement] 



ORF Name 



NTID 



AAID „ — 



|&I7503000990_22000943_ti_21g 

Description 

NO-HIT 



NT AA 
LengthLength 



score 



probability 



7T 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



|AI7b03000990 22400283 c2 830 



[5521 ||981~||325 |p?T 



probability 
i.ie-135 



Description 

pirr [LNrC36718] [AC r C36718 r A69674 ] [PNrpyruvate dehydrogenase (lipoamide) , El beta 
chain precursor pdhB] [GNrpdhB ] [CLrpyruvate dehydrogenase (lipoamide) beta chain] 
[ORrBacillus subtilis] [ECrl.2.4.1] [DBrpir2] >gp r [GI : gl43378] [LN r BACPYDHY] 
[ACrM57435 rM31542] [PNrpyruvate decarboxylase (E-l) beta subunit] [GNrpdhB] 
[ORrBacillus subtilis] [SR r B . subtilis (strain 168) BRB1 (sacA321 metB5) cell line DNA, 
clone] [DBrgenpept-bctl] [EC: 1.2. 4.1] [DE rB . subtilis pyruvate dehydrogenase complex 
genes, complete cds ; PAL-related lipoprotein (sip) gene, complete cds, 
lysinedecarboxylase (cad) gene, partial cds.] [LEr 2796] [RE r 3773] [DIrdirect] 
>gp: [GIrell85049rg2633830] [LN : BSUB0008] [AC : Z99111 : AL009126] [PNrpyruvate 
dehydrogenase (El beta subunit)] [GNrpdhB] [ORrBacillus subtilis] [DBrgenpept-bctl] 
[ECrl.2.4.1] [DErBacillus subtilis complete genome (section 8 of 21): from 1394791to 
1603020.] [SP:P21882] [LE:134060] [RE:135037] [DI:direct] >gp : [GI : g3282143] 
[LN:AF012285] [AC : AF012285 : AF012284 : U51911] [PN:pyruvate decarboxylase E-l beta 
subunit] [GN:pdhB] [OR:Bacillus subtilis] [DB :genpept-bct2] [EC:1.2.4.1] [DE:Bacillus 
subtilis mobA-nprE gene region.] [NT:similar to pyruvate decarboxylase (E-l) beta] 
[LE:34548] [RE:35525] [Dlidirect] 



419 



NT AA 

QRF Name MTID AAID Le ^thLe^th ^^ probability 

|AI7503000990J>2429Ja_c3_1009 | [ 1 7 50 | [ 5522 || 933 |piO | |TPSS"| |1 ,2e-10fe | 

Description 

sp: [LN:YLYB_BACSU] [AC : Q45480 : 031732] [GNrYLYB] [0R:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 
33.7 KD PROTEIN IN LSP-PYRR INTERGENIC REGION (ORF-X) ] [SP : Q45480 : 031732] 
[DB:swissprot] >gp : [GI :gl373157] [LN: BSU48870] [AC:U48870] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis signal peptidase II (lsp) gene, complete 
cds, isoleucyl-tRNA synthetase (ileS) and pyrR genes, partial cds.] [NT:orf-X; 
hypothetical protein; Method: conceptual] [LE:1658] [RE: 2569] [DI : direct] 

NT AA 

QRF Name NT ID AAID , — . — , score probability 
LengthLength 

A17B03000990_2246i07fl_ci_769 | [1751 | [5523 |[7T5~| [ 244 j |gI3~| |4 . ie-8i 



Description 

gp: [GI:gl314301] [LN: SAU41072] [AC:U41072] [PN:unknown] [GN : ORF] [OR : Staphylococcus 
aureus] [DB : genpept-bct2] [DE : Staphylococcus aureus isoleucyl-tRNA synthetase (ileS) 
gene, partial cds.] [LE:106] [RE: 723] [DI: direct] 

NT AA 

ORF Name NT ID AAID — — _ score probability 
LengthLength c ^ 



[Al7503000990_224fe2B02_c3_1026 | [1752 | [ 5524 |pPT] | 4 0 0 | [ 1070 ||3 . ie- 10« 

Description 

pir: [LN:D69878] [AC:D69878 ] [PN : pantothenate metabolism flavoprotein homolog ylol] 
[GN:yloI ] [CL: pantothenate metabolism flavoprotein dfp] [OR:Bacillus subtilis] 
[DB:pir2] >gp : [GI : ell85161 :g2633942] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:yloI] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 9 of 21): from 1598421to 1807200.] [NT:similar to pantothenate 

metabolism flavoprotein] [LE:43830] [RE:45050] [DI:direct] >gp : [GI : e323501 :g2337799] 
[LN:BSY13937] [AC:Y13937] [PN:putative Dfp protein] [GNryloI] [FN:unknown] [OR:Bacillus 

subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis genomic DNA from the spoVM region.] 
[LE:5602] [RE:6822] [DI:direct] 

NT AA 

ORF Name NTID AAID „ — — ^ score probability 

LengthLength c L 



[AI7503000990_22478427_c3_987 | [1753 | [ 5525 |p3— 1| 45 | [TTB — 1| 4 . Ue-07 

Description 

sp: [LN:GGI2_STAHA] [AC:P11698] [OR : STAPHYLOCOCCUS HAEMOL YT I CUS ] [DE : ANTIBACTERIAL 
PROTEIN 2 (GONOCOCCAL GROWTH INHIBITOR 2)] [SP:P11698] [DB : swissprot] >pir : [LN : BXSA2] 
[AC:S00600 ] [PN: antibacterial protein 2 gonococcal growth inhibitor 2] 

[CL: Staphylococcus haemolyticus antibacterial protein] [OR : Staphylococcus haemolyticus] 
[DB:pirl] 

NT AA 

ORF Name NTID AAID — — ; , score probabil ity 
L en gth Length 



A:£7B03000990_22B42Bg7_c2_fl66 | [1754 | | B526 |[7?9~ | | 252 | [5g2~| |2 . ie-54 



Description 

pir: [LN:G69984] [AC:G69984 ] [ PN : rRNA methylase homolog ysgA] [GN : ysgA ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell84114:g2 635330] [LN : BSUB0015] [AC : Z99118 : AL009126] 
[GN:ysgA] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 15 of 21): from 2795131to 3013540.] [NT:similar to rRNA 
methylase] [LE:134799] [RE:135545] [DI : complement] >gp : [GI : ell65323 :gl770029] 
[LN:BSZ75208] [AC:Z75208] [PN : hypothetical protein] [GN : ysgA] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic sequence 89009bp.] [NT:Homology to SpoU-type 
rRNA methyltransferases;] [LE: 34362] [RE: 35108] [DI: direct] 



420 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th £2Q£e probability 

[Al7B03000990_22VVbl^fc_t2_411 | |1755 | [ 5527 [pi~||210 | pfej || 1.0e-22 — [ 

Description 

pir: [LN:A69859] [AC:A69859 ] [PN : hypothetical protein ykoE] [GN:ykoE ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell81523 :g2632043] [LN : BSAJ2 571] [AC : AJ002571] [PN : YkoE] 
[GN:ykoE] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [DE:Bacillus subtilis 
168 56 kb DNA fragment between xlyA and ykoR.] [LE:42565] [RE:43164] [DI : complement] 
>gp: [GI:ell83343:g2633677] [LN : BSUB0007] [AC : Z99110 : AL009126] [GNiykoE] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [LE:195514] [RE:196113] [D I : complement] 



NT AA 

ORF Name NTID AAID Le ^ t ^ Le ^ th score probability 



|A17503000990_22931g42_c2_802 | |175£ | [ 5528 || i22 ¥] [ 407 | [ 1250 || 2 .5e-127 — | 

Description 

gp: [GI:dl024918 :g2696796] [LN : AB00963 5] [AC : ABO 0963 5] [PN:Fmt] [GN:fmt] 
[OR: Staphylococcus aureus] [SR : Staphylococcus aureus (strain : KSA8 ) DNA] 
[DB:genpept-bctl] [DE : Staphylococcus aureus DNA for Fmt, complete cds . ] [LE:1234] 
[RE : 2427] [DI :direct] 

NT AA 

ORF Name NTID AAID — - — , score probability 
LengthLength 

|AI750300u990_23438887_cl_748 | [1757 | [5529 |[^u~| | 99 | [196 |[TTTe -lb | 

Description 

pir: [LN:A69985] [AC:A69985 ] [PN : hypothetical protein yshA] [GN: yshA ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell84110 : g2635326] [LN:BSUB0015] [AC : Z99118 : AL009126] 
[GN: yshA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis 
complete genome (section 15 of 21): from 2795131to 3013540.] [LE:129612] [RE:129869] 
[DI: complement] >gp : [GI : ell65327 : gl 7 7 003 3] [LN : BSZ75208] [AC:Z75208] [PN : hypothetical 
protein] [GN:yshA] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic 
sequence 89009bp.] [NT:unknown function; putative] [LE:40038] [RE:40295] [DI:direct] 

NT AA 

ORF Name NTID AAI D — , — , score probability 
LengthLength ^ 



AI750300u990^23442177^t3_gQ0 | [1758 [ [ 5530 | |669 | [ 222 [ p^~|| 2 . 5e-40 



Description 

pir: [LN:D69864] [AC:D69864 ] [PN : hypothetical protein yktB] [GN:yktB ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell85055:g2 633836] [LN : BSUB0008] [AC : Z99111 : AL009126] 
[GN:yktB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis 
complete genome (section 8 of 21) : from 1394791to 1603020.] [LE: 140850] [RE: 141488] 
[DI: complement] >gp : [GI :g3282149] [LN : AF012285] [AC : AF012285 : AF012284 :U51911] 
[PN:unknown] [GN:yktB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 2] 
[DE: Bacillus subtilis mobA-nprE gene region.] [LE: 41338] [RE: 41976] [DI : complement] 



421 



NT AA 

ORFName NTID AAID ^ thLe — ^ score probability 

|AI7503000990_2344aU38_tl_217 | [1 7 59 | [ 5531 | |8 7 3 [[ 290 | . Ue-VU — | 

Description 

sp: [LN:FOLD_BACSU] [AC:P54382] [GN : FOLD] [OR : BACILLUS SUBTILIS] [EC : 1 . 5 . 1 . 5 : 3 . 5 . 4 . 9] 
[DE : METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE , ] [SP : P54382] [DB : swissprot] 
>pir: [LN:E69626] [AC:E69626 ] [PN : methylene tetrahydrof olat e dehydrogenase (NADP+) , / 
methenyltetrahydrof olate cyclohydrolase, ] [GN: f olD ] [CL:methylenetetrahydrof olate 
dehydrogenase (NAD+) : methyl enetetrahydrof olate dehydrogenase (NAD+) homology] 
[OR:Bacillus subtilis] [EC :1. 5. 1.5:3. 5. 4. 9] [DB:pir2] >gp : [GI :dl013251 :gl303916] 
[LN:BACJH642] [AC :D84432 : D82370] [PNrYqiA] [ORiBacillus subtilis] [SR:Bacillus subtilis 
(strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis DNA, 283 Kb 
region containing skin element.] [LE: 190351] [RE: 191202] [DI: direct] 
>gp: [GI:ell85699:g2634865] [LN : BSUB0013] [AC : Z99116 : AL009126] 

[PN: methenyltetrahydrof olate cyclohydrolase] [GN : f olD] [FN:purines and amino acids 
biosynthesis] [OR: Bacillus subtilis] [DB : genpept-bctl] [EC: 1 . 5 . 1 . 5 : 3 . 5 . 4 . 9] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT alternate gene name: yqiA;] [SP:P54382] [LE: 132419] [RE: 133270] [DI : complement] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 

|&I7S03000990_234B3767J:i_fi2 | [1760 | [5532 ||T5B~~| | 45 | 

Description 
NO-HIT 

NT AA 

ORF Name NTID AAID — , — , , score probabilit y 
LengthLength 

[AI7503000990_23492327_tl_37 | [1761 | | 5b33 ||Zra~| | 67 | p& | |3.3e-0B — [ 

Description 

pir: [LN:S75993] [AC:S75993 ] [PN : hypothetical protein] [OR : Synechocystis sp.] [SR:PCC 
6803, , PCC 6803] [SR:PCC6803, ] [DB:pir2] >gp : [GI :dl011491 :gl001353] [LN : SYCSLLLH] 
[AC:D64006 :AB001339] [PN : hypothetical protein] [GN:clpP] [OR : Synechocystis sp.] 
[SR: Synechocystis sp. (strain : PCC6803 ) DNA] [DB : genpept-bctl] [DE : Synechocystis sp. 
PCC6803 complete genome, 25/27, 3138604-3270709.] [NT :ORF__ID : sll0498] [LE:80076] 
[RE : 80528] [DI : complement] 

NT AA 

ORF Name NTID AAID — , — , score probability 

LengthLength c 



AI7S03000990_23B32329_ca_8a9 1 [1752 | [5534 |pTTT] |370 | pT2"]| i . 3e-146 



Description 

Sp : [LN : ODPA_BACST] [AC:P21873] [GN : PDHA] [OR:BACILLUS STEAROTHERMOPHILUS] [EC:1.2.4.1] 
[DE: PYRUVATE DEHYDROGENASE El COMPONENT, ALPHA SUBUNIT, ] [SP:P21873] [DB : swissprot] 

NT AA 

ORF Name NTID AAID — , — , , score probabili ty 



I AI7S03000990 23550313 E2 329 



Description 
NO-HIT 



LengthLength - 
[1763 | [5535 ||14i ||45 | 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

[AI7503000930_23572178_c2_8gl | [1754 | [5535 |28i | [1070 |[3 . le-108 



Description 

sp: [LN:DHSB__BACSU] [AC:P08066] [GN: SDHB] [OR:BACILLUS SUBTILIS] [EC : 1 . 3 . 99 . 1] 
[DE:SUCCINATE DEHYDROGENASE IRON-SULFUR PROTEIN,] [SP:P08066] [DB : swissprot] 



422 



NT AA 

ORF Name NTID AAID Lg^thLgngth score probability 



Al7b03000990_23b722S3_clJ796 | [1 7 65 | [ 5537 || 741 ~| [245 || 623 || 7.ie-6i 



Description 

pir: [LN:B69693] [AC : B69693 : JC4821 ] [PN : ribonuclease III,:RNase D : RNase 0] [GN:rncS:srb 
] [CL: ribonuclease III : double- stranded RNA-binding repeat homology] [OR:Bacillus 
subtilis] [EC:3.1.26.3] [DB:pir2] >gp: [GI : ell85184 :g2633965] [LN: BSUB0009] 
[AC:Z99112 :AL009126] [PN : ribonuclease III] [GNirncS] [ORrBacillus subtilis] 
[DB:genpept-bctl] [EC : 3 . 1 . 26 . 3] [DErBacillus subtilis complete genome (section 9 of 
21): from 1598421to 1807200.] [NT : alternate gene name: rnc] [LE:66689] [RE:67438] 
[DI: direct] 



NT AA 

ORF Name NTID AAID — , — , , score proba bility 
Length Length c ^ 



■17503000590_236325g2_c2_fla0 | [1765 | [5538 |[gTu~~ | | 269 | |2TU | |1 . 8e- T? 



Description 

sp: [LN:YFIE_BACSU] [AC:P54721] [GN : YFIE] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 31 . 5 
KD PROTEIN IN GLVBC 3 1 REGION] [SP:P54721] [DB : swissprot] >pir : [LN : H69802] [AC:H69802 ] 
[PN:conserved hypothetical protein yfiE] [GN:yfiE ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI :ell82814 :g2633148] [LN : BSUB0005] [AC : Z99108 : AL009126] [GN:yfiE] [FN:unknovm] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
5 of 21): from 802821 tol011250.] [NT:similar to hypothetical proteins from B. 
subtilis] [SP:P54721] [LE:94696] [RE:95553] [DI:direct] >gp : [GI : dl009744 : gl486247] 
[LN:D50543] [AC:D50543] [PN:unknown] [GN:yfiE] [FN:unknown] [OR:Bacillus subtilis] 
[SR: Bacillus subtilis (strain: 168, haplotype : haploid) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis DNA for 76-degree region, complete cds . ] [LE:8372] [RE: 9229] 
[DI : direct] 



NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength 



piI7S03000990_236422i7_cl_790 | [1757 | | bb39 |p3~ ]| 215 | p55~| |8 . le- 37 

Description 

pir: [LN:C69879] [AC:C69879 ] [PN : hypothetical protein yloS] [GN:yloS ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell85171:g2 633952] [LN:BSUB0009] [AC : Z99112 : AL009126] 
[GN:yloS] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [LE:55709] [RE:56353] 
[DI: direct] >gp : [GI : e323508 :g2337809] [LN:BSY13937] [AC:Y13937] [PN:YloS protein] 
[GN:yloS] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
genomic DNA from the spoVM region.] [LE: 17481] [RE: 18125] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , . score probability 
LengthLength *- 



IAI7503000990 23647178 c2 903 



1758 | |bb40 |pi~||210 |p^~[|5.0e-T7 



Description 

pir: [LN:A69880] [AC:A69880 ] [PN : hypothetical protein ylpC] [GN:ylpC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI:ell85179:g2 633960] [LN : BSUB0009] [AC : Z99112 : AL009126] 
[GN:ylpC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 9 of 21) : from 1598421to 1807200.] [LE: 62946] [RE: 63512] 
[DI: direct] >gp : [GI : e323513 :g2337817] [LN : BSY13937] [AC:Y13937] [PN:YlpC protein] 
[GN:ylpC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
genomic DNA from the spoVM region.] [LE: 24718] [RE: 25284] [DI: direct] 



423 



NT AA 

ORFName NTID AAI2 Le ^thLe^th ^^ probability 



|A17b03000990J^b0293_cl_78b | [ 1759 | | S54i || 527 | pTT5 | p3~ || 2 . 7e-68 | 

Description 

pir: [LN:B69878] [AC:B69878 ] [PN : guanylate kinase homolog yloD] [GN:yloD ] 
[CLrguanylate kinase : guanylate kinase homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85159:g2633940] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:yloD] [FN:unknown] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to guanylate kinase] [LE:42808] 
[RE:43542] [DI:direct] >gp : [GI : e323500 :g2337797] [LN : BSY13937] [AC:Y13937] [PN:putative 
Gmk protein] [GN:yloD] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis genomic DNA from the spoVM region.] [LE:4580] [RE:5314] 
[DI :direct] 

NT AA 

ORF Name NTID AAID — „ — . score probability 
LengthLength Lm 

AI7g03000990_23695900_c2_812 | [1770 | [ 5542 ||1551 | | blfe | [ 1490 |pT5e -lb3 | 



Description 

gp: [GI:g4097534] [LN : LLU64311] [AC:U64311] [PN : phosphor ibosylpyrophosphate 
amidotransf erase] [GN:purF] [OR : Lactococcus lactis] [DB : genpept-bct2] [EC : 2 . 4 . 2 . 14] 
[DE : Lactococcus lactis phosphoribosylaminoimidazolesuccinocarboxamidesynthetase (purC) , 
phosphor ibosylf ormylglycinamidine synthetase I (purQ) , phosphor ibosylf ormylglycinamidine 
synthetase II (purL) , andphosphor ibosylpyrophosphate amidotransf erase (purF) genes, 
completecds; and unknown gene.] [NT:PRPP ATase] [LE-.4921] [RE:6441] [DI: direct] 

NT AA 

ORF Name NTID AAID — _^ — , , score probabi lity 
LengthLength -L 

[A17b03000990_23730340_ci_716 | [1771 | [5543 |[T?2T]| 473 [ | 2270 j[2 . le-23b 1 

Description 

pir: [LN:S19723] [AC:S19723 ] [PN : dihydrolipoamide dehydrogenase, :pyruvate dehydrogenase 
complex chain E3] [GN :pdhD ] [CL : dihydrolipoamide dehydrogenase : dihydrolipoamide 
dehydrogenase homology] [OR : Staphylococcus aureus] [EC: 1.8. 1.4] [DB:pirl] 
>gp: [GI:g48874] [LN: SAPDHDNA] [AC : X58434 : S73625] [PN : dihydrolipoamide dehydrogenase: 
subunit E3] [GN:pdhD] [OR : Staphylococcus aureus] [DB : genpept -be tl ] [EC: 1.8. 1.4] 
[DE:S. aureus pdhB, pdhC and pdhD genes for pyruvate decarboxylase , dihydrolipoamide 
acetyltransf erase and dihydrolipoamidedehydrogenase . ] [LE:1853] [RE: 3259] [DI: direct] 



NT AA 

ORF Name NTID AAID — _ — , , score probab ility 
LengthLength -t - 



AI7503000990_23836036_c2_887 | [1772 | [ 5544 | p^~| | 63 | pi~ | |1 . 2e-09 



Description 

sp: [LN:CARB_BACCL] [AC:P46537] [GN : PYRAB] [OR:BACILLUS CALDOLYTICUS] [EC:6.3.5.5] 
[DE:(EC 6.3.5.5) (CARBAMOYL- PHOSPHATE SYNTHETASE AMMONIA CHAIN)] [SP:P46537] 
[DB:swissprot] >pir : [LN : 140169] [AC : 140169 : S34321 ] [PN : carbamoyl -phosphate synthase 
(glutamine-hydrolyzing) , ] [GN:pyrAb ] [CL : carbamoyl -phosphate synthase 
(glutamine-hydrolyzing) large chain :biot in carboxylase homology : carbamoyl -phosphate 
synthase (glutamine-hydrolyzing) large chain homology] [OR: Bacillus caldolyticus] [EC: 
6.3.5.5] [DB:pir2] >gp : [GI :g312443] [LN : BCPYR] [AC:X73308] [PN : carbamoyl -phosphate 
synthase] [GN:PyrAb] [OR:Bacillus caldolyticus] [DB :genpept-bctl] [EC: 6. 3. 5. 5] 
[DE:B. caldolyticus pyrimidine biosynthesis genes.] [SP:P46537] [LE:3658] [RE:6855] 
[DI : direct] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength *- 

AI7503000990_2392fl537_c3_553 | [1773 | [5545 |pn~||46 | 

Description 
NO-HIT 



424 



ORF Name 



NT 



AA 



NTI5 MI5 Le - thLe - th 3£ore probability 



17503000990 24025^53 ti 215 



~] |1774 | [5546 ||32i | |1Q6 [ j!58 j|1.3e-14 



Description 

pir: [LN:T00323] [AC:T00323 ] [PN : chitinase, B] [OR : Clostridium paraputrif icum] 
[EC:3.2.1.14] [DB:pir3] >gp : [GI : dl024701 : g2696017] [LN : AB001874] [AC : AB001874 ] 
[PN:chitinase B] [GN : chiB] [OR : Clostridium paraputrif icum] [SR : Clostridium 
paraputrif icum (strain :M21) DNA] [DB :genpept-bctl] [EC: 3 . 2 . 1 . 14] [DE Clostridium 
paraputrif icum gene for chitinase B, complete cds . ] [LE:1] [RE: 2496] [DI: direct] 



ORF Name 



NT 



AA 



NT ID AAID Le ^ t fr Le ^ th score probability 



03000990 24025467 c2 863 



] [1775 | [5547 |J5S2~| [173 | pg~ || 4 . 5e-20 



Description 

sp : [LN : YSNB_BACSU] [AC:P94559] [GN : YSNB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 19.2 

KD PROTEIN IN RPH-ILVB INTERGENIC REGION] [SP:P94559] [DB : swissprot] >pir : [LN : D69986 ] 
[AC:D69986 ] [PN : conserved hypothetical protein ysnB] [GN : ysnB ] [CL : conserved 

hypothetical protein MG207: phosphoesterase core homology] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI :ell84084 :g2 635300] [LN: BSUB0015] [AC : Z99118 : AL009126] [GN : ysnB] 
[FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 15 of 21) : from 2795131to 3013540.] [NT: similar to hypothetical 

proteins] [SP:P94559] [LE: 103990] [RE: 104505] [DI : complement ] 

>gp: [GI:ell65358:gl770061] [LN : BS275208] [AC:Z75208] [PN : hypothetical protein] 
[GN:ysnB] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic sequence 
89009bp.] [NT:homology to HI0260 of Haemophilus influenzae;] [SP:P94559] [LE:65402] 
[RE:65917] [DI :direct] 



ORF Name 



NT ID 



AAID 



NT 



LengthLength 



AA 

— , score probability 



IAI7503000990 24220002 ti 67 



] [1776 | [5548 ||129"]|42 | 



Description 
NO-HIT 



ORF Name 



NT AA 
NT ID AAID — _ T — , , score 
LengthLength 



probability 



LAI7503000990 24240676 t3 477 



"| [1777 | [5549 |p46~||81 | 



Description 
NO-HIT 

ORF Name 



NT ID AAID 



NT 



AA 



LengthLength 



score 



IAI7503000990 24245437 cl 763 



TT7W 



[5550 IpUTTTj [5T5~ 



TT7T 



probability 
9.8e-i5i 



Description 

gp: [GI:dl023423 :g2463562] [LN: AB007500] [AC : ABO 075 00 :D28879] [PN:MRAY] [GN:mraY] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain :NCTC8325) DNA] 
[DB :genpept-bctl] [DE : Staphylococcus aureus genes for penicillin-binding protein 1, 

MraY,MurD, partial and complete cds.] [LE:3120] [RE:4085] [DI:direct] >gp : [GI : g4104230] 
[LN:AF034153] [AC : AF034153] [PN : phospho-N-acetylmuramoyl -pentapeptide] [GN:mraY] 
[OR : Staphylococcus aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus 

phospho-N-acetylmuramoyl-pentapeptidetranslocase (mraY) gene, complete cds.] [LE:28] 
[RE: 993] [DI:direct] 



ORF Name 



NT ID 



NT AA 

AAID — _^ — , . score probabili ty 
LengthLength c 



AI7503000990^24256250_r3_467 

Description 

NO-HIT 



TTTT 



425 



ORF Name 



NT 



AA 



NTID AAID — _ — _ 
LengthLength 



score 



1780 I |5552 |pU7T"[ pb6 



probability 
3.0e-94 



[AI7503000990_242b6b51_c3J)23 

Description 

sp: [LN:PUR5_BACSU] [AC:P12043] [GN : PURM : ATH] [0R:BACILLUS SUBTILIS] [EC:6.3.3.1] 
[DE : (PHOSPHORIBOSYL-AMINOIMIDAZOLE SYNTHETASE) (AIR SYNTHASE)] [SP:P12043] 
[DBiswissprot] >pir : [LN : AJBSCL] [AC : H29326 : A69685 ] 

[PN: phosphor ibosylformylglycinamidine cyclo-ligase , iphosphoribosylaminoimidazole 
synthetase] [GN:purM ] [CL rphosphoribosylf ormylglycinamidine cyclo-ligase: 
phosphoribosylformylglycinamidine cyclo-ligase homology] [OR:Bacillus subtilis] 
[EC:6.3.3.1] [DB:pirl] [MP:18min] >gp : [GI : gl43371] [LN : BACPURF] [AC : J02732 :K00047] 
[0R:Bacillus subtilis] [SR : B . subtilis (strain DEI (prototroph DER. or W168) ) DNA # clone 
pPZ] [DB : genpept-bctl] [DE : B . subtilis pur operon encoding purine biosynthesis enzymes, 
12genes.] [NT : phosphor ibosyl aminoimidazole synthetase (PUR-M) ] [LE:8796] [RE:9836] 
[Dlidirect] >gp: [GI :ell82630:g2 632964] [LN : BSUB0004] [AC : Z99107 : AL009126] 
[PN iphosphoribosylaminoimidazole synthetase] [GNrpurM] [FNrpurine biosynthesis] 
[ORrBacillus subtilis] [DB : genpept-bctl] [EC:6.3.3.1] [DE:Bacillus subtilis complete 
genome (section 4 of 21): from 600701 to813890.] [SP:P12043] [LE:105778] [RE:106818] 
[DI:direct] 



ORF Name 



IAI7503000990 24257877 c2 854 



Description 
NO-HIT 



NT AA 
NTID AAID — _ — , , score 
LengthLength 



probability 



J [1781 | [5553 [p^T~| [42 | 



ORF Name 



NT 



AA 



NTID AAID — . T — _ 
LengthLength 



score 



U.7503000990 24261058 cl 758 



] [1782 | [5554 ||TT7 | |48 |pTF" 



probability 
1.2e-17 



Description 

gp: [GI:g3212079] [LN : AF068633 ] [AC : AF068633] [PN:phenol soluble modulin beta 1] 
[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB : genpept-bct2] 
[DE: Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds.] [NT:PSM beta 1] [LE:669] [RE:803] [DI:direct] 

>gp: [GI:g3212080] [LN : AF068633 ] [AC : AF068633] [PN:phenol soluble modulin beta 2] 

[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB : genpept-bct2] 

[DE: Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds.] [NT:PSM beta 2] [LE:859] [RE:993] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



1AI7503000990 24251068 c3 988 



[1783 | [5555 |pT7~[ p" 



[2nr 



probability 
1.2e-17 



Description 

gp: [GI:g3212079] [LN: AF068633] [AC : AF068633] [PN:phenol soluble modulin beta 1] 
[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB :genpept-bct2] 
[DE: Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds.] [NT.-PSM beta 1] [LE:669] [RE:803] [DI: direct] 

>gp: [GI:g3212080] [LN : AF068633] [AC : AF068633] [PN:phenol soluble modulin beta 2] 

[FN: inflammatory protein] [OR : Staphylococcus epidermidis] [DB :genpept-bct2] 

[DE: Staphylococcus epidermidis phenol soluble modulin beta 1 and phenolsoluble modulin 

beta 2 genes, complete cds..] [NT:PSM beta 2] [LE:859] [RE:993] [DI:direct] 



ORF Name 



AI7503000990_24275137_rl_21 

Description 

NO-HIT 



NT AA 
NTID AAID — ^ — . , score 
LengthLength 

[1784 | |5b56 |pT-]|43 | 



probability 



426 



ORF Name 



NT ID AAID 



NT 



AA 



LenqthLength 



score probability 



F 



17503000990 24'2DT2TI cl 7B0 



][ 



|5557 ||3H4 | [1037 l |T755~|pTTr 



5.5] 



Description 

sp: [LN:CARB_BACCL] [AC:P46537] [GN : PYRAB] [OR:BACILLUS CALDOLYTICUS] [EC:6.3. 
[DE:(EC 6.3.5.5) (CARBAMOYL -PHOSPHATE SYNTHETASE AMMONIA CHAIN)] [SP:P46537] 
[DB:swissprot] >pir : [LN : 140169] [AC : 140169 : S34321 ] [PN: carbamoyl -phosphate synthase 
(glutamine-hydrolyzing),] [GNrpyrAb ] [CL : carbamoyl -phosphate synthase 
(glut amine -hydroly zing) large chain :biotin carboxylase homology : carbamoyl -phosphate 
synthase (glutamine-hydrolyzing) large chain homology] [OR: Bacillus caldolyticus] [EC: 
6.3.5.5] [DB:pir2] >gp : [GI :g312443] [LN: BCPYR] [AC:X73308] [PN : carbamoyl -phosphate 
synthase] [GN:PyrAb] [OR: Bacillus caldolyticus] [DB :genpept-bctl] [EC: 6. 3. 5. 5] 
[DE:B. caldolyticus pyrimidine biosynthesis genes.] [SP:P46537] [LE:3658] [RE: 6855] 
[DI: direct] 



ORF Name 



NT ID 



24330337 r3 557 



NT AA 

AAID 

LengthLength 



score probability 
555 ||i.le-53 



IE 



Description 

pir: [LN:H69984] [AC:H69984 ] [PN : conserved hypothetical protein ysgB] [GN:ysgB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84111 :g2635327] [LN:BSUB0015] 
[AC:Z99118 :AL009126] [GN : ysgB] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to hypothetical proteins] [LE: 130003] [RE: 130944] [DI: direct] 
>gp: [GI:ell65326:gl770032] [LN:BS275208] [AC:Z75208] [PN : hypothetical protein] 
[GN:ysgB] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic sequence 
89009bp.] [NT:unknown function; putative] [LE : 38963] [RE: 39904] [DI : complement] 



ORF Name 



NT ID AAID 



NT 



AA 



LengthLength 



score probability 



|AI750300099u_24351577_clj^77 



Description 
NO-HIT 



] [1787 | [5559 |pm~| p" 



ORF Name 



IAI7503000990 24355342 ci 759 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



[1788 | [5560 |puTT]|338 | [1265 ||6 . 6e- 129 



Description 

sp: [LN:SR54_BACSU] [AC:P37105] [GN : FFH] [OR: BACILLUS SUBTILIS] [DE: SIGNAL RECOGNITION 
PARTICLE PROTEIN ( FIFTY- FOUR HOMOLOG) ] [SP:P37105] [DB : swissprot ] >pir: [LN:B47154] 
[AC:B47154 :H69621 ] [PN:signal recognition particle chain ffh] [GN:ffh ] [CL:signal 
recognition particle 54K protein] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell85189:g2633970] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN:signal recognition 
particle] [GN:ffh] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [LE:73153] [RE:74493] 
[DI:direct] >gp : [GI : dl022545 :g2309080] [LN:D14356] [AC:D14356] [PN:Ffh] [GN:ffh] 
[OR: Bacillus subtilis] [SR: Bacillus subtilis (strain: 168) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis orfl, ffh, rpsP genes for ORF1, Ffh and 30Sribosomal protein S16, 
complete cds . ] [LE:711] [RE: 2051] [DI:direct] 



427 



ORF Name 



NTID 



NT AA 
AAID — , — , , score 
LengthLength 



F 



17503000990 24406291 c2 902 



T7W 



[ 5561 || 41V ~ || 13a 



probability 
b.9e-34 



Description 

pir: [LN:D69879] [AC:D69879 ] [PN : alkaline- shock protein homolog yloU] [GN:yloU ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e!185174 :g2633955] [LN: BSUB0009] 
[AC:Z99112 :AL009126] [GNiyloU] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT: similar to alkaline- shock protein] [LE: 57043] [RE: 57405] [DI:direct] 

>gp: [GI:e323527:g2337812] [LN: BSY13 937] [AC:Y13937] [PN:putative Asp23 protein] 
[GN:yloU] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 

genomic DNA from the spoVM region.] [LE: 18815] [RE: 19177] [DI : direct] 



ORF Name 



IAI7503000990 24406563 cl 742 



Description 
NO-HIT 



NTID 



T7W 



NT 



AA 



^ LengthLength probability 
|5562 |P^~| [46 | 
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L£7b03000990 24407313 r3 590 



Description 
NO-HIT 



NTID AAID 
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NT AA 
LengthLength 
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score probability 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7503000990 24407327 c3 1017 



[5564 ||gl2~||203 |pr 



probability 



Description 

Sp : [LN : PYRE_BACSU] [AC:P25972] [GN : PYRE : PYRX] [OR:BACILLUS SUBTILIS] [EC : 2 . 4 . 2 . 10] 
[DE : OROTATE PHOSPHORIBOSYLTRANSFERASE , (OPRT) (OPRTASE) ] [SP:P25972] [DB : swissprot] 
>pir: [LN:F69686] [AC : F69686 : A30492 ] [PN : orotate phosphoribosyl transferase , ] [GN : pyrE ] 
[CL:orotate phosphoribosyltransf erase : orotate phosphoribosyl transferase homology] 
[OR:Bacillus subtilis] [EC : 2 . 4 . 2 . 10] [DB:pirl] >gp : [GI : ell85148 : g2633929] [LN:BSUB0009] 
[AC : Z99112 : AL009126 ] [PN : orotate phosphoribosyltransf erase] [GN : pyrE] [FN : pyrimidine 
biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 .4 . 2 . 10] [DE:Bacillus 
subtilis complete genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene 
name: pyrX] [SP:P25972] [LE:30299] [RE:30949] [DI:direct] >gp : [GI :gl43394] 
[LN : BACPYROP] [AC : M59757] [PN:OMP-PRPP transferase] [GN : pyrE] [OR:Bacillus subtilis] 
[DB : genpept-bct2] [DE:Bacillus subtilis ORF1A (pyrR) , putative membrane -bound 
uracilpermease (pyrP) , aspartate transcarbamylase (pyrB) , dihydroorotase (pyrC) , 
glutaminase of carbamyl phosphate synthetase (pyrAA) , carbamyl phosphate synthetase 
(pyrAB) , dihydroorotase dehydrogenase (pyrD) , OMP decarboxylase (pyrF) , and OMP-PRPP 
transferase (pyrE) genes, complete cds; and unknown gene.] [LE: 11703] [RE: 12353] 
[DI :direct] 



ORF Name 



NT AA 
NTID AAID — ^ T — ^ score 
LengthLength 



probability 



AI7503000990_24407760_cl_773 

Description 
NO-HIT 



] [1793 | [5565 ||123 | [40 | 
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ORF Name 



NT 



AA 



NTID AAID _ , « _ 

■ LengthLength 



score 



AI7503000990 24407936 c2 896 



] [1794 | [5566 |[TT22-||373 |pW] [ 



probability 
|4.3e-141 



Description 

sp: [LN:YLON_BACSU] [AC:034617] [GN:YLON] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 41.6 
KD PROTEIN IN FMT-SPOVM INTERGENIC REGION] [SP:034617] [DB : swissprot] >pir : [LN : F698 78] 
[AC:F69878 ] [PN : conserved hypothetical protein yloN] [GN:yloN ] [CL : conserved 
hypothetical protein HI0365] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell85166:g2633947] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:yloN] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [SP:034617] 
[LE:50265] [RE:51356] [DIrdirect] >gp: [GI : e323524 : g2337804] [LN : BSY13937] [AC:Y13937] 
[PN:YloN protein] [GN : yloN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis genomic DNA from the spoVM region.] [SP:034617] [LE:12037] 
[RE:13128] [DIrdirect] 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



IAI7503000990 24412812 c3 986 



Description 
NO-HIT 



ORF Name 



NTID 



^ n NT AA 

AAID — — score 

LengthLength 



probability 



IAI7503000990 24415885 c3 960 



[5558 |pTT||408 [[859 ||6.1e-87 



Description 

gp: [GI:g4096797] [LN: SCU40157] [AC:U40157] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus condensing -enzyme -like protein (orfl) 
andSpoVE-like protein (orf2) genes, complete cds . ] [NT:orf2; unknown function; similar 
to SpoVE, RodA, ] [LE:1676] [RE:2779] [DI:direct] 



ORF Name 



NTID 



_ _ _ NT AA 

AAID — — .score 

LengthLength 



[A17503000990 24422077 c3 1013 



~| [1797 | [5569 ||li0!q |366 |[TT5F 



probability 
li.4e-121 



Description 

sp : [LN : CARA_BACCL] [AC:P52557] [GN : PYRAA] [OR:BACILLUS CALDOLYTICUS] [EC:6.3.5.5] 
[DE: (EC 6.3.5.5) (CARBAMOYL- PHOSPHATE SYNTHETASE GLUT AMINE CHAIN)] [SP:P52557] 
[DB: swissprot] >pir : [LN: 140168] [AC : 140168 : S34320 ] [PN : carbamoyl -phosphate synthase 
(glutamine-hydrolyzing) , ] [GN:pyrAa ] [CL : carbamoyl -phosphate synthase 
(glutamine-hydrolyzing) small chain : carbamoyl -phosphate synthase 

(glutamine-hydrolyzing) small chain homology : trpG homology] [OR:Bacillus caldolyticus] 
[EC: 6.3.5.5] [DB:pir2] >gp : [GI :g312442] [LN: BCPYR] [AC:X73308] [PN: carbamoyl -phosphate 
synthase] [GN:PyrAa] [OR:Bacillus caldolyticus] [DB : genpept-bctl] [EC:6.3.5.5] 
[DE:B. caldolyticus pyrimidine biosynthesis genes.] [SP:P52557] [LE:2571] [RE: 3665] 
[DI : direct] 



ORF Name 



NT AA 
NTID AAID — — , , score 
LengthLength 



probability 



IAI7503000990 24424038 cl 791 



] [1798 | |SS70 |JIg57"||b63 | p?F]|7 . 8e-158 



Description 

pir: [LN:E69879] [AC:E69879 ] [PN : conserved hypothetical protein yloV] [GN:yloV ] 
[CL Mycoplasma genitalium hypothetical protein MG369] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85175:g2633956] [LN: BSUB0009] [AC : Z99112 : AL009126] [GN:yloV] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT: similar to hypothetical proteins] [LE: 57421] 
[RE:59082] [DI:direct] >gp : [GI : e323510 :g2337813] [LN: BSY13937] [AC:Y13937] [PN:YloV 
protein] [GN:yloV] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis genomic DNA from the spoVM region.] [LE: 19193] [RE: 20854] [DI: direct] 



429 
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NT 



AA 



AI7503000990_2447b252J:2_37i 
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Description 
NO-HIT 

ORF Name 
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LengthLength 



score 



^7503000990 24484828 c2 832 
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probability 
8.2e-28 



Description 

gp: [GI :g4981179] [LN : AE001739] [AC : AE001739 : AE000512] [PN : conserved hypothetical 
protein] [GN:TM0656] [OR : Thermo toga maritima] [DB : genpept -bct2 ] [DE : Thermo toga maritima 
section 51 of 136 of the complete genome.] [NT: similar to SP:P38522 GB:U00096 
PID:1742120] [LE:1379] [RE:1909] [D I : complement] 



ORF Name 



NT ID AAID _ — . , . . 



IAI7503000990 2448S9iiO c3 951 



][ 
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LengthLength 
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probability 
0 . 015 



Description 

gp: [GI:g5306139] [LN : AF160864 ] [AC : AF160864] [PN: NADH dehydrogenase subunit 2] 
[GN:nad2] [OR : Mitochondrion Tetrahymena pyriformis] [SR : Tetrahymena pyriformis] 
[DB: genpept] [EC: 1.6. 5. 3] [DE : Tetrahymena pyriformis mitochondrial DNA, complete 

genome.] [LE: 10108] [RE: 10644] [DI : complement] 



ORF Name 



NT 



AA 



NT ID AAID _ _ 

LengthLength 



score 



IAI7503000990 24609637 c3 1001 
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probability 
6 . 4e-92 



Description 

gp: [GI:g4009492] [LN : AF068904] [AC : AF068904] [PN : YlmD] [GN : ylmD] [OR : Staphylococcus 
aureus] [DB : genpept -bet 2] [DE : Staphylococcus aureus cell division protein FtsZ (ftsZ) 
gene,partial cds; YlmD (ylmD), YlmE (ylmE) , YlmF (ylmF) , YlmG (ylmG) , and YlmH (ylmH) 
genes, complete cds; and cell division proteinDivIVA (divIVA) gene, partial cds.] 
[NT: similar to Bacillus subtilis YlmD] [LE:437] [RE: 1228] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 
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Description 
NO-HIT 



1803 | [5575 ||423 ||140 | 



ORF Name 



NTID 



NT 



AA 



AAID — _ — score probability 
LengthLength s - 



IAI7503000990 24632827 cl 786 



] [1804 | [5576 |pFl~||320 ||917 ||5.0e-95 



Description 

pir : [LN:A69626] [AC:A69626 ] [PN :methionyl- tRNA formyltransf erase , fmt] [GN:fmt ] 
[CL:methionyl-tRNA formyltransf erase : phosphoribosylglycinamide formyltransf erase 

homology] [OR:Bacillus subtilis] [EC:2.1.2.9] [DB:pir2] >gp : [GI : ell85164 :g2633945] 
[LN:BSUB0009] [AC : Z99112 : AL009126] [PN :methionyl-tRNA formyltransf erase] [GN:fmt] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [EC: 2. 1.2. 9] [DE:Bacillus subtilis complete 

genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: yloL] 
[LE:47978] [RE.-48931] [DI:direct] >gp : [GI :e323503:g2 337802] [LN: BSY13937] [AC:Y13937] 
[PN: putative Fmt protein] [GN : yloL] [FN: unknown] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis genomic DNA from the spoVM region.] [LE:9750] 
[RE: 10703] [DI :direct] 



430 



NT AA 

ORFName NTID AAID ^^score probability 



|Al7503000990^4fe4204^c2_t}ba | [ 1505 | [ 5577 [pi - ]pS |p bb | |7 .ie-22 | 

Description 

pir: [LN:C69875] [AC:C69875 ] [PN : hypothetical protein ylbN] [GN : ylbN ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : e334782 :g2340010] [LN : BS16823KB] [AC:Z98682] [PN : YlbN 
protein] [GN:ylbN] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [DErBacillus subtilis 
genomic DNA 23.9kB fragment.] [LE: 20393] [RE: 20911] [DI: direct] 

>gp: [GI:ell85097:g2633878] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN:ylbN] [FN:unknown] 
[ORrBacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [LE:179876] [RE:180394] [DI: direct] 

NT AA 

ORF Name NTID AAID — — _ score probabi lity 
LengthLength 



AI7503000990 24642817 C2 862 



180$ | [5578 ||810 | | pT5T]|i . le-121 



Description 

Sp : [LN:MURI_STAHA] [AC:P52974] [GN:DGA] [OR : STAPHYLOCOCCUS HAEMOL YT I CUS ] [EC:5. 1.1.3] 
[DE : GLUTAMATE RACEMASE,] [SP:P52974] [DB : swissprot] >gp : [GI : g52 0574] [LN : SHU124 05] 
[AC:U12405] [ PN : glut ama te racemase] [GN:dga] [OR : Staphylococcus haemolyticus] 
[DB :genpept-bctl] [DE : Staphylococcus haemolyticus Y176 glutamate racemase (dga) 
gene, complete cds . ] [LE:263] [RE:1063] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|Ai7b03000990_24643836_c3_943 | [1807 | [5579 || 533 "^| | 210 | p27 | |6.6e-19 

Description 

pir: [LN:D69870] [AC :D69870 : A3 6 7 18 ] [PN : conserved hypothetical protein 

ykyA: hypothetical protein (aceA 5' region)] [GN:ykyA ] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85047:g2633828] [LN : BSUB0008] [AC : Z 99111 : AL009126] [GN:ykyA] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21) : from 1394791to 1603020.] [NT : alternate gene name: ykrC; similar to 
hypothetical] [LE:131900] [RE:132517] [DI:direct] >gp : [GI :g3282141] [LN : AF012285] 
[AC:AF012285:AF012284:U51911] [PN:unknown] [GN:ykrC] [FN:unknown] [OR:Bacillus 
subtilis] [DB: genpept -bet 2] [DE: Bacillus subtilis mobA-nprE gene region.] [NT: similar 
to Orf5 encoded by GenBank Accession] [LE: 32388] [RE: 33005] [DI:direct] 

NT AA 

ORF Name NTID AAID — . T — , , score probabilit y 
LengthLength c L 



AI7B03000950_2464S02S_c3_1012 | [1808 | [5580 |p^T]| 427 |pFT|| J . le-131 "] 



Description 

sp : [LN : PYRC_BACCL] [AC:P46538] [GN : PYRC] [OR: BACILLUS CALDOLYTICUS] [EC: 3. 5. 2. 3] 
[DE : DIHYDROOROTASE , (DHOASE) ] [SP:P46538] [DB : swissprot] >pir: [LN:I40167] 
[AC:I40167 :S34319 ] [PN : dihydroorotase , ] [GN:pyrC ] [CL:Bacillus 

dihydroorotase : Bacillus dihydroorotase homology] [OR: Bacillus caldolyticus] 
[EC:3.5.2.3] [DB:pir2] >gp : [GI :g312441] [LN : BCPYR] [AC:X73308] [PN : dihydroorotase] 
[GN:PyrC] [OR: Bacillus caldolyticus] [DB : genpept -bctl] [EC: 3. 5. 2. 3] [DE : B . caldolyticus 

pyrimidine biosynthesis genes.] [SP:P46538] [LE:1285] [RE: 2568] [DI: direct] 
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NTID 
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LengthLength 



score 



U.7b03000990 2464^412 cl 798 
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probability 
7.4e-27 



Description 

sp : [LN : YLXM_BACSU] [AC:P37104] [GN : YLXM] [OR : BACILLUS SUBTILIS] [DE : HYPOTHETICAL 13.2 
KD PROTEIN IN FFH 5 1 REGION] [SP:P37104] [DB : swissprot ] >pir: [LN:A47154] 

[AC:A47154 :A69882 ] [PN : conserved hypothetical protein ylxM] [GN : ylxM ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell85188 : g2633969] [LN : BSUB0009] [AC : Z99112 : AL009126] 

[GN:ylxM] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept - bet 1 ] [DE: Bacillus subtilis 
complete genome (section 9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical 
proteins] [LE:72807] [RE:73139] [DI: direct] >gp : [GI : dl023083 :g2424968] [LN:D14356] 

[AC:D14356] [PN:0RF1] [GN:orfl] [OR:Bacillus subtilis] [SR:Bacillus subtilis 

(strain: 168) DNA] [DB : genpept -bet 1] [DE:Bacillus subtilis orfl, ffh, rpsP genes for 
ORF1, Ffh and 30Sribosomal protein S16 , complete cds . ] [LE:365] [RE: 697] [DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000990 24552178 c2 847 
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probability 
|2.7e-13 



Description 

pir: [LN:F69930] [AC:F69930 ] [PN : conserved hypothetical protein yozB] [GN : yozB ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85386 :g2634307] [LN : BSUB0011] 
[AC:Z99114 :AL009126] [GN : yozB] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 11 of 21): from 2000171to 2207900.] 
[NT: similar to hypothetical proteins] [LE: 85155] [RE: 85691] [DI : complement] 



ORF Name 



NTID 



AAID 



U.7S03000990 24730340 c3 937 



1811 



NT AA 
LengthLength 
T77 



TIT 



score probability 
731 ||2.5e-72 



Description 

Sp : [LN : YKQB_BACSU] [AC:P39760] [GN: YKQB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 24.3 
KD PROTEIN IN KINC-ADEC INTERGENIC REGION (ORF4)] [SP:P39760] [DB : swissprot] 
>pir : [LN:A69862] [AC : A69862 : PC6016 ] [PN : conserved hypothetical protein ykqB] [GN:ykqB 
] [CL:conserved hypothetical protein MG323] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:dl007628:g520844] [LN : BACAMOKOOO] [AC:D37799] [PN:orf4] [OR:Bacillus subtilis] 
[SR: Bacillus subtilis (sub_species : Marburg, strain: 168] [DB : genpept-bctl] [DE: Bacillus 
subtilis genes for ampS, mreBH, orfl, kinC, orf3, orf4 andorf 5 . ] [LE:5175] [RE:5840] 
[DI: direct] >gp: [GI :ell85 04 l:g2 633822] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN:ykqB] 
[FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 
genome (section 8 of 21): from 1394791to 1603020.] [NT : alternate gene name: ylxV, yzaC; 
similar to] [SP:P39760] [LE:125146] [RE:125811] [DI:direct] >gp : [GI : g3282136 ] 
[LN:AF012285] [AC : AF012285 : AF012284 :U51911] [PN:unknown] [GN:ykqA] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept -bct2] [DE: Bacillus subtilis mobA-nprE gene region.] 
[NT: similar to product of orf4 encoded by GenBank] [LE: 25634] [RE: 26299] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — _ — , , score 
LengthLength 



probability 



|A17b03000990 24U01713 c3 946 



T&TT 



5584 111314 I WT7 



1906 ||7.9e-197 



Description 

sp: [LN:0DP2_STAAU] [AC:Q59821] [GN : PDHC] [OR : STAPHYLOCOCCUS AUREUS] [EC : 2 . 3 . 1 . 12 ] 
[DECOMPLEX, (E2)] [SP:Q59821] [DB : swissprot] >pir : [LN: S19722] [AC:S19722 ] 
[PN:dihydrolipoamide S-acetyltransf erase, chain E2] [CL : dihydrolipoamide 

acetyltransf erase : lipoyl/biotin-binding homology] [OR : Staphylococcus aureus] 
[EC:2.3.1.12] [DB:pir2] >gp : [GI:g581570] [ LN : S APDHDNA] [AC : X58434 : S73625] 
[PN: dihydrolipoamide acetyltransf erase : subunit E2] [GN:pdhC] [OR : Staphylococcus 

aureus] [DB : genpept-bctl] [EC : 2 . 3 . 1 . 12] [DE:S. aureus pdhB, pdhC and pdhD genes for 

pyruvate decarboxylase , dihydrolipoamide acetyltransf erase and 

dihydrolipoamidedehydrogenase . ] [SP :Q59821] [LE : 557] [RE : 1849] [DI : direct] 
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NT AA 

ORF_Name NTID AAID Le ~ thLe ~ th score probability 

|A17B03000990J*4U0fcbB7_c2J>48 | [ 1813 | [ 5585 | | i056 1 1351 |pgg— || J . 9e- JV -| 

Description 

pir: [LN:H69873] [AC:H69873 ] [PN : conserved hypothetical protein ylbC] [GNiylbC ] 
[ORiBacillus subtilis] [DB:pir2] >gp : [GI : e334771 :g2339999] [LN : BS 16 82 3KB] [AC:Z98682] 
[PN:YlbC protein] [GN:ylbC] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 

subtilis genomic DNA 23.9kB fragment.] [LE:11510] [RE:12550] [Dlidirect] 

>gp: [GI:ell85086:g2633867] [LN:BSUB0008] [AC : Z99111 : AL009126] [GNiylbC] [FN:unknown] 
[ORiBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 

8 of 21): from 1394791to 1603020.] [NT:similar to hypothetical proteins from B. 

subtilis] [LE:170993] [RE:172033] [DIzdirect] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

|AI7503000990_24flgi577_c3_97i | [1814 | p^~|| 564 1 1 187 || 405 || 9.0e-J8 | 

Description 

pir: [LN:E69874] [AC:E69874 ] [PN : conserved hypothetical protein ylbH] [GN:ylbH ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e334776 :g2340004] [LN : BS 16 82 3 KB] [AC:Z98682] 
[PN: YlbH protein] [GN: ylbH] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis genomic DNA 23.9kB fragment.] [LE:14650] [RE:15144] [DI:direct] 
>gp: [GI:ell85091:g2633872] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN : ylbH] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to hypothetical proteins] [LE:174133] 
[RE: 174627] [DI idirect] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|ftI7B03000990_2S425202_cl_749 | [1815 | pF7~ |pT"] | 1V3 | p^~[|i . ie-ib 

Description 

pir: [LN:B69985] [AC:B69985 ] [PN : hypothetical protein yshB] [GN:yshB ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : el 184109 :g2 635325] [LN : BSUB0015] [AC : Z99118 : AL009126] 
[GN:yshB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 15 of 21): from 2795131to 3013540.] [LE:129072] [RE:129605] 
[DI: complement] >gp: [GI : ell65328 :gl770034] [LN : BSZ75208] [AC:Z75208] [PN : hypothetical 
protein] [GN : yshB] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE :B . subtilis genomic 
sequence 89009bp.] [NT:unknown function; putative] [LE:40302] [RE:40835] [DI:direct] 



NT AA 

ORF Name NTID AAID — — , , score probabilit y 
L en gthLength c J ~ 



|&I7503000990_25475250_c3_9fl5 | [1816 | [5588 |pTT| p8 | p^uT]| 4 . 3e-260 — | 



Description 

pir: [LN:A27763] [AC : A2 7763 : C69704 ] [PN : succinate dehydrogenase, f lavoprotein : fumarate 
reductase : fumaric hydrogenase : succinic dehydrogenase] [GN: sdhA : sdhB ] [CL: fumarate 
reductase f lavoprotein: 3 -oxosteroid 1- dehydrogenase homology : fumarate reductase 
f lavoprotein homology] [OR:Bacillus subtilis] [EC : 1 . 3 . 99 . 1] [DB:pir2] 



NT AA 

ORF Name NTID AAID — , — , . score probability 
Length Length 



mg03000990_2B609540J:lJLfl5 | |18i7 | [ 5589 |pu~|| 79 |[73 1| 0.0084 



Description 

sp : [LN : YRUB_CLOPA] [AC:P23171] [OR : CLOSTRIDIUM PASTEURIANUM] [DE : (ORF B) ] [SP:P23171] 
[DB:swissprot] >pir : [LN: S29118] [AC:S29118 ] [PN: hypothetical protein 2] 
[OR Clostridium pasteurianum] [DB:pir2] >gp : [GI :gl44907] [LN : CLORUB] [AC :M60116] 
[OR : Clostridium pasteurianum] [SR : C .pasteurianum (strain ATCC 6013) DNA] 
[DB:genpept-bctl] [DE : C .pasteurianum open reading frame A, B, C, and rubredoxin 
gene, complete cds . ] [NT: open reading frame B] [LE:1126] [RE: 1353] [DI: direct] 



433 



ORF Name 



U 7 503000990 2bfo04b 7V cl 735 



NT ID 



NT 

AAID . 



AA 



Length Length 



score 



[5590 ||4b0~||i49 |pT" 



probability 
5.6e-29 



Description 

pir: [LN:C69874] [AC:C69874 ] [PN : conserved hypothetical protein ylbF] [GNiylbF ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e334774 : g2340002] [LN : BS16823KB] [AC:Z98682] 
[PN : YlbF protein] [GN: ylbF] [OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus 
subtilis genomic DNA 23.9kB fragment.] [LE: 13551] [RE: 14000] [DI : direct] 
>gp: [GI:ell85089:g2633870] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN: ylbF] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to hypothetical proteins] [LE:173034] 
[RE : 173483] [DI : direct] 



ORF Name 



NTID 



^ m NT AA 

AAID — » — , , score 

LengthLength 



probability 



|AiVi>03000990 2W3b962 c3 1008 



] [1819 | [5591 | pg— 1| 161 |pn — 1| 6 . 9e-24 ^ 



Description 

gp: [GI:e244971:gl340128] [LN:SA1234] [AC:X97985] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE : S . aureus orfs 1,2,3 & 4.] [NT :ORFl] [LE:537] [RE: 1304] 
[DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



[AI7503000990 25816552 c3 1004 



] [1820 | [5592 |J777~|pBfl ||gg7 



score probability 
4.0e-99 



Description 

gp: [GI:g4009496] [LN : AF068904] [AC : AF068 904 ] [PN: YlmH] [GN:ylmH] [OR : Staphylococcus 
aureus] [DB : genpept -be t2 ] [DE : Staphylococcus aureus cell division protein FtsZ (ftsZ) 
gene, partial cds; YlmD (ylmD) , YlmE (ylmE) , YlmF (ylmF) , YlmG (ylmG) , and YlmH (ylmH) 
genes, complete cds; and cell division proteinDivIVA (divIVA) gene, partial cds.] 
[NT:similar to Bacillus subtilis YlmH] [LE:2865] [RE:3671] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503 



000990 25818811 c2 841 



J 



UT2T" 



[5593 |pT-]|50 ||162 |[5.ie-12 



Description 

gp: [GI:gl022725] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 
[DE : Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF2] 
[LE:394] [RE:1083] [D I : complement] >gp : [GI :g295162] [LN : STAMECRA] [AC:L14017] 
[OR staphylococcus aureus] [SR : Staphylococcus aureus (strain COL) DNA] 
[DB:genpept-bctl] [DE : Staphylococcus aureus methicillin-resistance protein (mecR) 
geneand unknown ORF, complete cds.] [NT:unknown ORF1; putative] [LE.-1492] [RE:2181] 
[DI : complement] 



ORF Name 



IA17503000990 25939030 12 399 



Description 
NO-HIT 



NT AA 
NTID AAID — _ — , - score 
LengthLength 



probability 



] [1822 | [5594 ||183 | p 



ORF Name 



IAI7503000990 25972207 tl 6 



Description 
NO-HIT 



NTID 



][ 



TF2T" 



AAID 



NT AA 
Length Length 

|5595 ||126 | |41 | 



score probability 
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ORF Name 



NT AA 
NTID AAID , — ^ T — , . score 
LengthLength 



probability 



IAI7503000990 25976401 ti 533 



] [1624 | [5596 ||1B0~||49 | 



Description 
NO-HIT 



ORF Name 



NTID 



„ TO NT AA 

AAID — , — score 

LengthLength 



probability 



^7503000990 25354550 ci 740 



192 I 157 



Description 
NO-HIT 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



score 



probability 



^7503000990 26423305 ci 767 



] [1825 | [5598 ||9.3e-35 



Description 

gp: [GI:g4009495] [LN : AF068904] [AC : AF068904] [PN : YlmG] [GN:ylmG] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus cell division protein FtsZ (ftsZ) 
gene # partial cds; YlmD (ylmD) , YlmE (ylmE) , YlmF (ylmF) , YlmG (ylmG),and YlmH (ylmH) 
genes, complete cds; and cell division proteinDivIVA (divIVA) gene, partial cds.] 
[NT: similar to Bacillus subtilis YlmG] [LE:2492] [RE: 2782] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — T — , , score 
LengthLength 



probability 



E 



17503000990 26597077 c2 875 



"| [1827 | [5599 |pW] |465 | [1113 ||8 . 5e-il3 



Description 

gp: [GI :g2149896] [LN: SAU94706] [AC:U94706] [PN:cell division protein] [GN:divlB] 
[OR staphylococcus aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus strain ATCC 
8325-4 cell wall/cell divisiongene cluster, yllB, yllC, yllD, pbpA, mraY, murD, divlB, 
ftsA andftsZ genes, complete cds.] [LE:7179] [RE: 8498] [DI: direct] 



ORF Name 



NT AA 

NTID AAID _ — __ score 

LengthLength 



AI7503000990 26598402 c3 1011 



probability 
"] [1828 | [5500 |pTT||437 | p5T]|2 . 0e-i27 



Description 

sp : [LN : PYRP_BACCL] [AC:P41006] [GN: PYRP] [OR : BACILLUS CALDOL YT I CUS ] [DE : URACIL PERMEASE 
(URACIL TRANSPORTER)] [SP:P41006] [DB : swissprot] >pir : [LN : S3 88 93] [AC:S38893 ] 
[PN: uracil transport protein : uracil permease] [CL: uracil transport protein uraA] 
[OR:Bacillus caldolyticus] [DB:pir2] >gp : [GI :g431231] [LN: BCPYRQP] [AC:X76083] 
[PN: uracil permease] [GN:pyrP] [OR: Bacillus caldolyticus] [DB :genpept-bctl] 
[DE:B. caldolyticus (DSM405) pyrR, pyrP and pyrB (partial) genes.] [SP:P41006] [LE:1490] 
[RE:2788] [DI:direct] 



ORF Name 



NTID AAID — J , score 
LengthLength 



probability 



IA17503000990 25514167 ci 795 



] [1829 | [5501 ||2*q— 1|79 | p3~ ||4 . be-20 



] 



Description 

sp: [LN:ACP_BACSU] [AC : P80643 : P51832] [GN : ACPA : ACPP] [OR:BACILLUS SUBTILIS] [DE : ACYL 
CARRIER PROTEIN (ACP) ] [SP : P80643 : P51832 ] [DB : swissprot] >pir : [LN : JC4822 ] 
[AC : JC4822 : A69582 ] [PN:acyl carrier protein : 8 . 5K protein] [GN:acpA:srb ] [CL:acyl 
carrier protein:acyl carrier protein homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85183:g2633964] [LN : BSUB0009] [AC : Z 99112 : AL009126] [PN:acyl carrier protein] 
[GN : acpA] [FN: fatty acid biosynthesis] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT alternate gene name: acpP] [SP:P80643] [LE: 66316] [RE: 66549] [DI:direct] 
>gp: [GI:dl011632 :gl237013] [LN:D64116] [AC:D64116] [PN:ORF2] [GN:orf2] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis DNA] [DB : genpept-bctl] [DE:Bacillus subtilis genes for 
ORF1, ORF2, ORF3, ORF4 and Srb, partialand complete cds.] [LE:140] [RE: 373] [DI: direct] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



1AI7503000990 2682B187 rl 147 



1830 | [5502 |pT~]|167 | p7~ ||1 . le-14 



Description 

pir: [LN:B69873] [AC:B69873 ] [PN : hypothetical protein ylaL] [GN: ylaL ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI : ell85072 : g2633853] [LN : BSUB0008 ] [AC : Z99111 : AL009126] 
[GN: ylaL] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:155412] [RE:155897] 
[DI: complement] >gp: [GI : e324323 :g2224771] [LN : BSZ97025] [AC:Z97025] [GN: ylaL] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis nprE, 

yla[A,B,C,D,E,F,G,H,I, J,K,L,M,N,0] and pycAgenes . ] [LE:9671] [RE: 10156] [D I : complement] 



ORF Name 



NTID AAID — — , , score probab ility 



IM7S03000950 2751250 El 573 



TZJT 



Description 
NO-HIT 



NT AA 
LengthLength 
[5503 ||153 | [SIT 



ORF Name 



NTID AAID — , 



(AI7503000990 2766500 c3 938 



NT AA 
LengthLength 

|5604 ||153 | pi 



score 



[TTT 



probability 
|2.8e-06 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT :0RF1] 

[LE : 1101] [RE : 1922] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score 



503000990 2789aoi t2 435 



1933 | [5605 |pTS~[p^T 



probability 
1.2e-ll 



Description 

pir: [LN:D70070] [AC:D70070 ] [PN : transcription regulator homolog ywtF] [GN:ywtF ] 
[CL: Bacillus subtilis probable transcription regulator yvhJ] [OR: Bacillus subtilis] 
[DB:pir2] 



ORF Name 



NTID AAID — — score probability 



117503000990 2835285 t3 456 



Description 
NO-HIT 



NT AA 
LengthLength 
[5606 IP^T"] 



ORF Name 



NTID 



NT 



AA 



AAID v 

LengthLength 



score 



1A17503000990 29319086 cl 739 



] [1835 | [5607 ||5T5~~ ||172 |p^T" 



probability 
2.4e-44 



Description 

pir: [LN:F69874] [AC:F69874 ] [PN : lipopolysaccharide core biosynthesis homolog ylbl] 
[GN:ylbI ] [CL : lipopolysaccharide core biosynthesis protein kdtB] [OR: Bacillus 
subtilis] [DB:pir2] >gp : [GI : e334777 :g2340005] [LN : BS16823KB] [AC:Z98682] [PN:YlbI 
protein] [GN:ylbI] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 
genomic DNA 23.9kB fragment.] [LE: 15208] [RE: 15693] [DI:direct] 

>gp: [GI:ell85092:g2633873] [LN : BSUBO 008] [AC : Z99111 : AL009126] [GN:ylbI] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to lipopolysaccharide core biosynthesis] 
[LE: 174691] [RE: 175176] [DI: direct] 



436 



ORF Name 



NT AA 
NT ID AAID — , T — _ score 
LengthLength 



probability 



IAI7503000990 29320217 t'l 271 



] [1835 | [5508 ||207 1 168 "| 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



E 



17503000990 25384818 c2 845 



] [1837 | [5609 IpTST] [TTg^ 



probability 
3900 110. 0 



Description 

pir: [LN:F69685] [AC:F69685 ] [PN:pyruvate carboxylase pycA] [GNrpycA ] [CL:pyruvate 
carboxylase :biot in carboxylase homology : lipoyl/biotin-binding homology] [OR: Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell85076 : g2633857] [LN: BSUB0008] [AC : Z99111 : AL009126] 
[PN:pyruvate carboxylase] [GN:pycA] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[EC:6.4.1.1] [DE:Bacillus subtilis complete genome (section 8 of 21): from 1394791to 
1603020.] [NT:alternate gene name: ylaP] [LE:158800] [RE:162246] [DI:direct] 



ORF Name 



NTID 



AAID 



IAI7503000990 29410908 c2 867 



TF31T 



NT AA 
LengthLength 
|5610 |P^~[ |42 I 



score probability 



Description 
NO-HIT 



ORF Name 



IAI7503000990 29488551 r2 398 



NTID 



11839 



7V7VTr , NT AA 

AAID — _ — , , score 

Length Length 



J [5611 jp53~| |77 Im- 



probability 
0.020 



Description 

gp: [GI:g453389] [LN : BCU05814] [AC:U05814] [PN : cytochrome C oxidase subunit III] 
[GN:COIII] [FN:electron transport] [OR : Kinetoplast Blastocrithidia culicis] 
[SR:Blastocrithidia culicis] [DB : genpept - invl ] [DE : Blastocrithidia culicis ATCC30268 
kinetoplast NADH dehydrogenase subunit 7 (ND7) gene, partial cds, and cytochrome C 
oxidase subunitlll (COIII) gene, complete cds.] [LE:223] [RE:1089] [DI:direct] 



ORF Name 



NTID 



IAI7503000990 30S5S300 c3 1030 



NT AA 

AAID 

LengthLength 

] [1840 | |5£i2 ||1353 | pu" 



score 



probability 
i.7e-9i 



Description 

sp: [LN:SUN_BACSU] [AC:P94464] [GN:SUN:FMU] [OR:BACILLUS SUBTILIS] [DE : SUN PROTEIN] 
[SP:P94464] [DB : swissprot] >pir : [LN : E6 9878] [AC:E69878 ] [PN:RNA- binding Sun protein 
homolog yloM] [GN:yloM ] [CL : hypothetical protein HI0624] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI :ell85165:g2 633946] [LN:BSUB0009] [AC : Z99112 : AL009126] [GN:yloM] 
[FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 
genome (section 9 of 21): from 1598421to 1807200.] [NT:alternate gene name: sun; 
similar to RNA-binding] [SP:P94464] [LE:48918] [RE:50261] [DI:direct] 

>gp: [GI:e323504:g2337803] [LN: BSY13937] [AC:Y13937] [PN:putative Fmu protein] [GN : yloM] 
[FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis genomic DNA 
from the spoVM region.] [SP:P94464] [LE:10690] [RE:12033] [DI: direct] 



437 



NT AA 

ORFJJame NTID AA1D Le ~ thLe ~ th probability 

|Al7503000990_a066Ji)bb_c^B09 | [1841 | [ 5513 [[ liST ] pi ||7g7^j| a . 9e-7fe | 

Description 

sp: [LN:PURK_BACSU] [AC:P12045] [GN : PURK] [OR: BACILLUS SUBTILIS] [EC : 4 . 1 . 1 . 2 1] [DE : (AIR 
CARBOXYLASE) (AIRC) ] [SP:P12045] [DB : swissprot] >pir: [LNrDCBSPK] [AC : B29326 :G69684 ] 
[PN:phosphoribosylaminoimidazole carboxylase, carbon dioxide- fixation 
chain: phosphor ibosylaminoimidazole carboxylase chain II] [GN :purK ] 
[CL:phosphoribosylaminoimidazole carboxylase carbon dioxide- fixation 

chain :phosphoribosylaminoimidazole carboxylase carbon dioxide- fixation chain homology] 
[OR:Bacillus subtilis] [EC : 4 . 1 . 1 . 21] [DB:pirl] [MP: 18 min ] >gp : [GI :gl43365] 
[LN:BACPURF] [AC : J02 732 : K0004 7] [OR:Bacillus subtilis] [SR : B . subtilis (strain DEI 
(prototroph DER . or W168)) DNA, clone pPZ] [DB :genpept-bctl] [DE : B . subtilis pur operon 
encoding purine biosynthesis enzymes, 12genes . ] [NT : phosphor ibosyl aminoimidazole 
carboxylase II] [LE:920] [RE:2059] [DI:direct] >gp : [GI : ell82623 :g2632957] [LN : BSUB0004] 
[AC:Z99107:AL009126] [PN : phosphoribosylaminoimidazole carboxylase II] [GN:purK] 
[FN:purine biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 4 . 1 . 1 . 21] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[SP:P12045] [LE:97902] [RE:99041] [DI: direct] 

NT AA 

ORF Name NTID AAID — „ — , , score prob ability 
LengthLength c L - 



IAI7503000990 30745580 t2 431 



[1842 | [5514 |pTO7-|| 6g8 | pT75~[| 2 . 3e-2^6 — [ 
Description 

sp: [LN:Q0X1_BACSU] [AC:P34956] [GN :QOXB : IPA-38D] [OR:BACILLUS SUBTILIS] [EC:1.9.3.-] 
[DE:SXJBUNIT QOXB) (OXIDASE AA(3) SUBUNIT 1)] [SP:P34956] [DB : swissprot] 
>pir: [LN:B38129] [AC : B3812 9 : S3 96 93 : F6 968 7 ] [PN:bo-type ubiquinol oxidase, chain 
I cytochrome aa3 quinol oxidase chain I:quinol oxidase aa3-600] [GN: qoxB ] 
[CL : cytochrome -c oxidase chain I : cytochrome -c oxidase chain I homology] [OR:Bacillus 
subtilis] [EC:1.10.3.-] [DB:pir2] >gp : [GI :gl43397] [LN:BACQOXA] [AC:M86548] [PN:quinol 
oxidase] [GN: QOXB] [OR:Bacillus subtilis] [SR:Bacillus subtilis DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis AA3-600 quinol oxidase (QOXA, QOXB, QOXC, QOXD)genes, complete 
Cds.] [LE:1846] [RE:3795] [DI:direct] >gp : [GI : g413962] [LN:BSGENR] [AC:X73124] 
[GN:ipa-38d qoxB] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic 
region (325 to 333).] [SP:P34956] [LE:40298] [RE:42247] [DI: direct] 

>gp: [GI:ell86315:g2636351] [LN: BSUB0020] [AC : Z99123 : AL009126] [PN : cytochrome aa3 quinol 
oxidase (subunit I)] [GN : qoxB] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 20 of 21): from 3798401to 4010550.] [NT : alternate 
gene name: ipa-38d] [SP:P34956] [LE: 115715] [RE: 117664] [DI : complement] 

NT AA 

ORF Name NTID AAID — — score probability 



AI7503000990JU272062J:i_i83 

Description 

NO-HIT 



LengthLength - 
[1843 | | 56i5 |p7T~[| 55 | 
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ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IA17503000990 32242890 r2 397 



1844 | [5616 |[T7uT]|566 | [1986 ||2 . 6e-"?u3~ 



] 



Description 

Sp: [LN : YKQC_BACSU] [AC:Q45493] [GN: YKQC] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 61.5 
KD PROTEIN IN ADEC-PDHA INTERGENIC REGION] [SP:Q45493] [DB : swissprot] >pir : [LN : B6 9862] 
[AC:B6 9862 ] [PN : conserved hypothetical protein ykqC] [GN:ykqC ] [CL : conserved 
hypothetical protein MG139] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :ell85043 :g2633824] 
[LN:BSUB0008] [AC : Z99111 : AL009126] [GN:ykqC] [FN:unknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 8 of 21) : from 
1394791to 1603020.] [NTisimilar to hypothetical proteins] [SP:Q45493] [LE:127733] 
[RE:129400] [DI : complement] >gp : [GI :g3282138] [LN: AF012285] 

[AC:AF012285:AF012284 :U51911] [PNrunknown] [GN:ykqC] [FNrunknown] [ORrBacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis mobA-nprE gene region.] [NTisimilar 
to aminopeptidase AMPS with Swiss-Prot] [LE: 28221] [RE: 29888] [DI : complement] 



ORF Name 



IAI7503000990 32657138 ti 155 



Description 
NO-HIT 



NTID 



_ _ — NT AA 

AAID — — , score 

Length Length 



probability 



] [1845 | [5617 l|m— | |58 | 



ORF Name 



NTID 



IA17503000990 32756 13 601 



„ T „ NT AA 
AAID — — , score 
Length Length 

[5618 |p¥uT]|46g ||748 ||4.0e-74 



probability 



Description 

pir : [LN:S62667] [AC:S62667 ] [PN:Nrampl protein] [CLtnatural resistance-associated 
macrophage protein 1] [OR:Oryza sativa] [SR:, rice] [DB:pir2] 



ORF Name 



NTID 



™ Tr , NT AA 

AAID — — , , score 

Length Length 



probability 



AI7503000990_3314i28_r2_238 

Description 

NO-HIT 



TttT 



[5619 | 



ORF Name 



NT AA 
NTID AAID — _ — , , score 
LengthLength 



1A17503000990 33153 c2 834 



] 



|5620 ||900 1 |299 | [^T" 



probability 
5.8e-43 



Description 

gp: [GI :g4981938] [LN: AE001791] [AC : AE001791 : AE000512] [PN : spermidine /putrescine ABC 
transporter, permease] [GN:TM1377] [OR : Thermo toga maritima] [DB : genpept-bct2] 
[DE : Thermotoga maritima section 103 of 136 of the complete genome.] [NT: similar to 
GB:L42023 SP:P45170 PID:1007357] [LE:8884] [RE:9672] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



Al7503000990_33367325_ci_706 

Description 

NO-HIT 



TttT 



|5621 |55 | 



439 



ORF Name 

1A17B03000990 33b94187 cl 728 



NT AA 
NT ID AAID — , — , , scor e 
LengthLength 



probability 



] [1850 | [5522 |prr"||100 |flbO ||7.1e-Hr 



Description 

gp: [GI :g2668605] [LN: AF015453] [AC : AF015453 ] [PN:unknown] [OR : Lactobacillus rhamnosus] 
[DB:genpept-bct2] [DE : Lactobacillus rhamnosus 6-phospho-beta-glucosidase homolog 
gene, partial cds; GNTR transcriptional regulator homolog and surf acelocated protein 
genes, complete cds.] [NT : 3 . 0E-ORF- 1] [LE:2236] [RE:>3603] [DI : complement] 



ORF Name 



LAI7S03000990 33595178 t3 475 



Description 
NO-HIT 



NT ID 



[1851 | 



AAID 



NT AA 
LengthLength 
TF3 



score 



probability 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7B03000990 33651636 c3 981 



[5624 |p77F]|591 |[T55F 



probability 
9.0e-157 



Description 

pir: [LN:C69985] [AC:C69985 ] [PN:probable DNA- dependent DNA polymerase beta chain yshC] 
[GN:yshC ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI :ell84108 :g2635324] [LN: BSUB0015] 
[AC:Z99118 :AL009126] [GN:yshC] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[NT: similar to DNA polymerase beta] [LE : 127286] [RE: 128998] [DI : complement] 
>gp: [GI:ell65329:gl770035] [LN : BSZ75208] [AC:Z75208] [PN : hypothetical protein] 
[GN:yshC] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis genomic sequence 
89009bp.] [NT:unknown function /putative] [LE:40909] [RE:42621] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



I&I7S03000&90 34017812 t3 480 



] 



T8TT 



[5625 |p772"[|573 | p3^]|6 . 3e-140 



Description 

pir: [LN:G69877] [AC:G69877 ] [PN : f ibronectin-binding protein homolog yloA] [GN : yloA ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e332190 :g2462963] [LN : BSPYREYLO] [AC : AJ000974] 
[PN:putative f ibronectin-binding protein] [GN:yloA] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis pyrE to yloA gene region.] [NT:protein A-like] 
[LE:6297] [RE:8015] [DI : complement] >gp: [GI : ell85156 :g2633937] [LN : BSUB0009] 
[AC:Z99112 :AL009126] [GN:yloA] [FN:unknown] [OR:Bacillus subtilis] [DB: genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT: similar to f ibronectin-binding protein] [LE: 37110] [RE: 38828] [DI : complement] 



ORF Name 

|AI7503000 990_34100626_r3_494 

Description 

NO-HIT 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



[5626 [|159 1 152 | 
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NT AA 

0RF Name NTID AAIP Le^thLe^th ^^ probability 



17b03000990_3417bk85_ciJ>86 | [1855 | [5627 |[5T1~| |170 ||b02 ||4.7e-48 



Description 

sp: [LN:PUR6_BACSU] [AC:P12044] [GN : PURE] [OR : BACILLUS SUBTILIS] [EC : 4 . 1 . 1 . 21] [DE:(EC 
4.1.1.21) (AIR CARBOXYLASE) (AIRC) ] [SP:P12044] [DB : swissprot] >pir : [LN:DEBSPE] 
[AC:A29326 :D69684 ] [PN : phosphoribosylaminoimidazole carboxylase, catalytic 
chain phosphoribosylaminoimidazole carboxylase chain I] [GN:purE ] 
[CL: phosphoribosylaminoimidazole carboxylase catalytic 

chain: phosphoribosylaminoimidazole carboxylase catalytic chain homology] [OR: Bacillus 
subtilis] [EC:4.1.1.21] [DBrpirl] [MP:18min] >gp : [GI :gl43364] [LN : BACPURF] 
[AC: J02732 :K00047] [OR:Bacillus subtilis] [SR : B . subtilis (strain DEI (prototroph DER. 
or W168)) DNA, clone pPZ] [DB : genpept-bctl] [DE :B. subtilis pur operon encoding purine 
biosynthesis enzymes, 12genes.] [NT : phosphoribosyl aminoimidazole carboxylase I] 
[LE:439] [RE:927] [DI: direct] >gp : [GI : el 182622 :g2 632956] [LN : BSUB0004] 
[AC : Z99107 : AL009126] [PN: phosphoribosylaminoimidazole carboxylase I] [GN :purE] 
[FN:purine biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 4 . 1 . 1 . 21] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[SP:P12044] [LE:97421] [RE:97909] [DI:direct] 

NT AA 

ORF Name NTID AAID — _„ — , , score probab ility 
LengthLength c L - 



1AI7503000990 344077S0 F2 358 



1855 | [5528 ||152~| p ||119 ||1.8e-07 



Description 

pir : [LN:G69872] [AC:G69872 ] [PN : hypothetical protein ylal] [GN:ylaI ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell85069:g2 633850] [LN:BSUB0008] [AC : 299111 :AL009126] 
[GN:ylaI] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 8 of 21) : from 1394791to 1603020.] [LE:153004] [RE:153213] 
[DI complement] >gp: [GI : e324322 :g2224768] [LN: BSZ97025] [AC:Z97025] [GN:ylaI] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis nprE, 

yla [A,B,C,D,E,F,G,H, I,J,K,L,M,N,0] and pycAgenes . ] [LE:7263] [RE:7472] [DI : complement] 

NT AA 

ORF Name NTID AAID — — , , score probability 
LengthLength c 



[A:nS03000930_34542213_c2_877 | [1857 | [5529 |204 ||58i ||5.ie-57 



Description 

gp: [GI:g4009494] [LN : AF068904] [AC : AF068904] [PN : YlmF] [GN:ylmF] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus cell division protein FtsZ (ftsZ) 
gene, partial cds; YlmD (ylmD) , YlmE (ylmE) , YlmF (ylmF) , YlmG (ylmG) , and YlmH (ylmH) 
genes, complete cds; and cell division proteinDivIVA (divIVA) gene, partial cds.] 
[NT:similar to Bacillus subtilis YlmF] [LE:1926] [RE:2480] [DI:direct] 

NT AA 

ORF Name NTID AAID — — . , score probability 
LengthLength c 



IAI7503000990 3454412b c2 875 



1858 | [5530 |prrr||471 ||T732-||9.Se-185 



Description 

Sp : [LN : FTSA_STAAU] [AC:O07325] [GN : FTSA] [OR : STAPHYLOCOCCUS AUREUS] [DE : CELL DIVISION 
PROTEIN FTS A] [SP:O07325] [DB : swissprot] >gp : [GI :g2149897] [LN: SAU94706] [AC:U94706] 
[PN:cell division protein] [GN: f tsA] [OR : Staphylococcus aureus] [DB : genpept -be t2 ] 
[DE: Staphylococcus aureus strain ATCC 8325-4 cell wall/cell divisiongene cluster, yllB, 
yllC, yllD, pbpA, mraY, murD, divlB, ftsA andftsZ genes, complete cds.] [LE:8604] 
[RE : 10019] [DI : direct] 



441 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score 



|A17503000990_3464693V_c3^983 



J [1859 | [5531 ||33S | [ill ] pTZ~ 



probability 
4 . le-49 



Description 

gp: [GI:el333201:g3776111] [LN : SATRXA] [AC : AJ223480] [PN : thioredoxin] [GN : trxA] 
[FN: thiol : disulfide interchange] [OR : Staphylococcus aureus] [DB :genpept-bctl] 
[DE : Staphylococcus aureus trxA and uvrC genes and partial mutS and dhsCgenes J 
[LE:2334] [RE:2648] [DI:direct] 



ORF Name 



NTID AAID „ — ^ — 



[AI7503000990^34650452_ci_782 



NT AA 
LengthLength 



score 



IT 



probability 
0.00078 



Description 

gp: [GI:g488925] [LN:A13473] [AC:A13473] [PN: 4 lkd antigen] [OR : Plasmodium falciparum] 
[SR:malaria parasite P. falciparum] [DB :genpept-pat] [DE : P . falciparum gene for 41kd 
antigen, clone 41-14.] [LE:<1] [RE:>532] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID , — _ 

LengthLength 



score 



IM7503000990 35285902 c2 859 



] [1851 | [5633 [pry— ]|49 |p | f 



probability 
UHJ^ 



Description 

sp: [LN:PGH2_MUSVI] [AC:062725] [GN : PTGS2 : COX2] [OR :MUSTELA VISON] [SR : , AMERICAN MINK] 
[EC:1.14.99.1] [DE : SYNTHASE 2) (PGH SYNTHASE 2) (PGHS-2) (PHS II)] [SP:062725] 
[DB:swissprot] >gp : [GI : g2959708] [LN: AF047841] [AC : AF047841] [PN prostaglandin synthase 
2] [OR:Mustela vison] [SR: American mink] [DB : genpept -mam] [DE:Mustela vison 
prostaglandin synthase 2 mRNA, complete cds . ] [NT: cyclooxygenase 2] [LE:11] [RE: 1825] 
[DI : direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



E 



17503000990 3b3427 t2 302 



Description 
NO-HIT 
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ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 
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Description 
NO-HIT 



] [1863 | [5635 p~ 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IA17503000990 35947191 c2 835 



[5636 |pT^"||384 |[TT9" 



probability 
6.3e-07 



Description 

gp: [GI:gl633572] [LN:KSU52064] [AC:U52064] [OR:Kaposi's sarcoma-associated herpesvirus] 
[SR:Kaposi's sarcoma-associated herpesvirus - Human herpesvirus 8] [DB : genpept -vrl] 
[DE:Kaposi's sarcoma-associated herpes-like virus ORF73 homolog gene, complete cds.] 
[NT herpesvirus saimiri ORF73 homolog] [LE:1] [RE:3489] [DI:direct] >gp : [GI :gl718329] 
[LN:KSU75698] [AC:U75698] [OR:Kaposi's sarcoma-associated herpesvirus] [SR:Kaposi»s 
sarcoma-associated herpesvirus - Human herpesvirus 8] [DB : genpept -vrl] [DE:Kaposi's 
sarcoma-associated herpesvirus long unique region, 80putative ORF's and kaposin gene, 
complete cds.] [NT : ORF 73; extensive acidic domains, potential leucine] [LE:123809] 
[RE: 127297] [DI : complement] 



442 



ORF Name 

|Al 7 b03000990 35978392 ci 795 



NT 



AA 



NTID AAID Le - thLe - th g£ore probability 



[1865 | [5537 j|2007 | |66B |p49 p.Oe-^" 



Description 

pir: [LN:H69878] [AC:H69878 ] [PNiprotein kinase homolog yloP] [GNryloP ] [CL:protein 
kinase homology] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85168 : g2633949] 
[LN:BSUB0009] [AC : Z99112 : AL009126] [GNryloP] [FN:unknown] [ORrBacillus subtilis] 
[DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 9 of 21) : from 
1598421to 1807200.] [NTrsimilar to protein kinase] [LE:52121] [RE:54067] [DI:direct] 
>gp: [GI:e323506:g2337806] [LN : BSY13 93 7] [AC:Y13937] [PN:putative Pkn2 protein] 
[GN:yloP] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
genomic DNA from the spoVM region.] [LE: 13893] [RE: 15839] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — . T — score 
LengthLength 



probability 



U7503000990 35980062 12 357 



Description 
NO-HIT 
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NTID 



-- T _ NT AA 
AAID — — score 
LengthLength 



F 
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probability 
] [1867 | [5539 ||288~| |95 ||408 [|4.3e-38 



Description 

sp: [LN : PTHP_STACA] [AC:P23534] [GNrPTSH] [OR : STAPHYLOCOCCUS CARNOSUS] 
[DE : PHOSPHOCARRIER PROTEIN HPR ( HI STIDINE -CONTAINING PROTEIN)] [SP:P23534] 
[DB:swissprot] >pir : [LN:A42374] [AC : S15367 : A42374 ] [PN : phosphotransferase system 
phosphohistidine- containing protein : phosphotransferase system HPr] 

[CL: phosphotransferase system phosphohistidine- containing protein phosphotransferase 
system phosphohistidine- containing protein homology] [OR : Staphylococcus carnosus] 
[DB:pirl] >gp: [GI:g46908] [LN:SCPTSH] [AC:X60766] [PN : Histidine-containing protein 
(HPr)] [GN:ptsH] [OR : Staphylococcus carnosus] [DB :genpept-bctl] [DE : S . carnosus ptsH 
gene for histidine-containing protein (HPr).] [SP:P23534] [LE:270] [RE:536] [DI:direct] 



ORF Name 

IAI7503000990 35141893 c3 1039 



NT 



AA 



NTID AAID , , _ 



LengthLength 



score 



][ 



probability 
[5640 ||IZW||4i5 — ||±±6g |p.ie-ii8 



Description 

gp: [GI:ell85186:g2633967] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN:signal recognition 
particle (docking protein)] [GN:ftsY] [FN:involved in secretion of extracellular 
proteins] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: srb] 
[SP:P51835] [LE:71119] [RE:72108] [Dlrdirect] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7B03000990 36142817 cl 717 
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probability 
7 i le-93 



Description 

pir: [LN:A70180] [AC:A70180 ] [PN : spermidine/putrescine ABC transporter, ATP-binding 
protein (potA) homolog] [CL: ATP-binding cassette homology] [OR:Borrelia burgdorferi] 
[SR:, Lyme disease spirochete] [DB:pir2] >gp : [GI :g2688562] [LN: AE001165] 
[AC:AE001165:AE000783] [PN : spermidine/putrescine ABC transporter,] [GN:BB0642] 
[OR:Borrelia burgdorferi] [SR:Lyme disease spirochete] [DB : genpep t - bet 2 ] [DE:Borrelia 
burgdorferi (section 51 of 70) of the complete genome.] [NT: similar to GB:M64519 
SP:P23858 PID:147326 GB:U00096] [LE:11911] [RE:12954] [D I : complement] 



443 



NT AA 

ORFJiame NTID AAID Le — thLe — th score probability 
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Description 

pir: [LN:C69986] [AC:C69986 ] [PN : conserved hypothetical protein ysnA] [GNrysnA ] 
[CL:Methanococcus jannaschii conserved hypothetical protein MJ0226] [OR:Bacillus 

subtilis] [DB:pir2] >gp: [GI : ell84085 :g2635301] [LN : BSUB0015] [AC : Z99118 : AL009126] 
[GNrysnA] [ FN : unknown ] [OR:Bacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis 

complete genome (section 15 of 21): from 2795131to 3013540.] [NT:similar to 

hypothetical proteins] [LE: 104515] [RE: 105111] [DI : complement] 

>gp: [GI:ell65357:gl770060] [LN : BSZ75208] [AC:Z75208] [PN: hypothetical protein] 
[GN : ysnA] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE :B . subtilis genomic sequence 
89009bp.] [NT:homology to ORFol97 of Escherichia coli; unknown] [LE:64796] [RE:65392] 
[DI: direct] 

NT AA 

ORF Name NTID AAID — _ — , , score probability 

LengthLength c L 



IAI7503000990 36219187 c3 920 1 [TSTT 
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Description 

sp: [LN:PURL_BACSU] [AC:P12042] [GN : PURL] [OR:BACILLUS SUBTILIS] [EC:6.3.5.3] 
[DE : SYNTHASE II)] [SP:P12042] [DB : swissprot ] >pir : [LN : SYBS2G] [AC : G2 9326 : C6 9685 ] 
[PN : phosphor ibosylf ormylglycinamidine synthase, component II : f ormylglycinamide ribotide 
amidotransf erase] [GN:purL ] [CL : phosphor ibosylf ormylglycinamidine synthase component 
II] [OR:Bacillus subtilis] [EC:6.3.5.3] [DB:pirl] [MP:18 min ] >gp : [GI :gl43369] 
[LN: BACPURF] [AC : J02732 : K00047] [OR:Bacillus subtilis] [SR : B . subtilis (strain DEI 
(prototroph DER. or W168) ) DNA, clone pPZ] [DB : genpept-bctl] [DE : B . subtilis pur operon 
encoding purine biosynthesis enzymes, 12genes . ] [NT : phosphor ibosylf ormyl glycinamidine 
synthetase II] [LE:5060] [RE:7288] [DI:direct] >gp : [GI : ell82628 :g2632962] [LN : BSUB0004] 
[AC:Z99107:AL009126] [PN:phosphoribosylf ormylglycinamidine synthetase I] [GN :purQ] 
[FN:purine biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:6.3.5.3] 
[DE:Bacillus subtilis complete genome (section 4 of 21): from 600701 to813890.] 
[SP:P12042] [LE:102042] [RE:104270] [DI:direct] 



NT AA 

ORF Name NTID AAID — . — , , score proba bility 
LengthLength u 

Ar7503000990_36445253_t2_320 | [1872 | | b644 |pT7— | |48 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



LengthLength - 

A!£7503000990_36g20302_c2_870 | [1873 | [ 5545 | p^~| | 62 || 250 || 2.4e-21 



Description 

sp : [LN : YLLB_STAAU] [AC:O07319] [GN : YLLB] [OR : STAPHYLOCOCCUS AUREUS] [DE : HYPOTHETICAL 
17.4 KD PROTEIN] [SP:O07319] [DB : swissprot] >gp : [GI : g2 149890] [LN : SAU94 706] [AC:U94706] 
[PN:unknown] [GN :yllB] [OR : Staphylococcus aureus] [DB :genpept-bct2] [DE : Staphylococcus 
aureus strain ATCC 8325-4 cell wall/cell divisiongene cluster, yllB, yllC, yllD, pbpA, 
mraY, murD, divlB, ftsA andftsZ genes, complete cds . ] [LE.-537] [RE: 971] [DI:direct] 



444 



ORF Name 



NT 



AA 



NTID AA!? Le - thLe ^ th ^2£e probability 



IA17503000990 36601687 ci 692 



][ 



TF7T" 



[ 5546 | | 1260| |419 | |IPTg-| |i . 6e-102 



Description 

sp: [LN:PUR2_BACSU] [AC:P12039] [GN: PURD] [OR:BACILLUS SUBTILIS] [EC : 6 . 3 . 4 . 13] 
[DE: RIBONUCLEOTIDE SYNTHETASE) (PHOSPHORIBOSYLGLYCINAMIDE SYNTHETASE)] [SP:P12039] 
[DB: swissprot] >pir: [LN: AJBSAG] [AC : B29183 : C69684 ] [PN : phosphor ibosylamine- -glycine 
ligase, : phosphor ibosylglycinamide synthetase] [GNrpurD ] 

[CL: phosphor ibosylamine- -glycine ligase : phosphor ibosylamine- -glycine ligase homology] 
[OR:Bacillus subtilis] [EC : 6 . 3 . 4 . 13] [DBrpirl] [MP:18 min ] >gp : [GI : g2465561] 
[LN:AF011544] [AC : AF011544 ] [PN : phosphor ibosylglycinamide synthetase] [GN:purD] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 

phosphoribosylaminoimidazole-carboxamidef ormyltransf erase (purH-J) gene, partial 
cds,phosphoribosylglycinamide synthetase (purD) , YecA (yecA) , putativeadenine deaminase 
(yecB) , YecC (yecC) , and YecD (yecD) genes , complete cds, and putative glutamate 
synthase (yecE) gene, partialcds.] [NT : identified by complementation of E.coli TX267,] 
[LE:263] [RE:1531] [DI:direct] >gp : [GI : gl43374] [LN : BACPURF] [AC : J02732 : K00047] 
[OR:Bacillus subtilis] [SR :B . subtilis (strain DEI (prototroph DER. or W168) ) DNA, clone 
pPZ] [DB:genpept-bctl] [DE :B . subtilis pur operon encoding purine biosynthesis enzymes, 
12genes.] [NT : phosphor ibosyl glycinamide synthetase (PUR-D; gtg] [LE: 11971] [RE: 13239] 
[DI:direct] >gp : [GI : ell82633 :g2632967] [LN:BSUB0004] [AC : Z99107 : AL009126] 
[PN: phosphor ibosylglycinamide synthetase] [GN:purD] [FN: purine biosynthesis] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 6 . 3 . 4 . 13] [DE:Bacillus subtilis complete 
genome (section 4 of 21): from 600701 to813890.] [SP:P12039] [LE:108953] [RE:110221] 
[DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID — ^ — _ score probability 
Length Length c L - 



IAI 7 503000990 5912890 fl 463 



] [1875 | [5647 |p2~ [ |73 |[246 ||6.4e-21 



Description 

Sp : [LN : RL2 8_BACST] [AC:P23374] [GN : RPMB] [OR:BACILLUS STEAROTHERMOPHILUS] [DE:50S 
RIBOSOMAL PROTEIN L2 8] [SP:P23374] [DB : swissprot] >pir : [LN : A48396] [AC:A48396 ] 
[PN:ribosomal protein L28] [CL : Escherichia coli ribosomal protein L28] [OR:Bacillus 
stearothermophilus] [DB : pir2 ] 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IAI7503000990 3945257 r3 664 



] [1876 | [5648 |pT7F]p91 | |7T8— 1|4 . 5e-73 



Description 

pir: [LN:E69687] [AC : E69687 : A3 8 12 9 : S3 96 92 ] [PN : cytochrome aa3 quinol oxidase (subunit 
II) qoxA:quinol oxidase aa3-600 chain qoxA] [GN: qoxA ] [CL:bo-type ubiquinol oxidase 
chain II precursor: cytochrome-c oxidase chain II homology] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI:ell863 16 :g2 636352] [LN: BSUB0020] [AC : Z99123 : AL009126] [PN : cytochrome 
aa3 quinol oxidase (subunit II)] [GN : qoxA] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:alternate gene name: ipa-37d] [LE:117692] [RE:118657] [DI : complement] 



ORF Name 



IAI7503000990 3960881 t3 483 



Description 
NO-HIT 



NT AA 

NTID AAID , , score 

LengthLength 



][ 



IT5TT 



probability 



|5649 |P^~[ 



ORF Name 



E 



17503000990 39808 ±3 498 



Description 
NO-HIT 



NTID 



NT AA 
AAID — — , score 
LengthLength 



probability 



1878 | [ 5650 || 135 1 1 44 | 
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ORF Name 



NTID 



NT AA 
AAID — , « _ — , . score 
LengthLength 



probability 



LL7i>03000990 399219 3 13 495 



Description 
NO-HIT 



] [1679 | [5551 ||177 | [58 | 



ORF Name 



NTID 



_ NT AA 

AAID — _ — , . score 

LengthLength 



probability 



IAI7503000990 4067342 c3 977 



1880 | [5652 |prgTg^T| |3gj ||1267 ||4.ie-129 



Description 

sp: [LN:SYFA_BACSU] [AC : P17921 : P9453 9] [GNrPHES] [OR : BACILLUS SUBTILIS] [EC : 6 . 1 . 1 .20] 
[DE : -TRNA LIGASE ALPHA CHAIN) (PHERS) ] [SP : P17921 : P94 53 9] [DB : swissprot] 
>pir: [LN:YFBSA] [AC : H69675 : 140459 : S11730 ] [PN : phenylalanine tRNA ligase, alpha 
chain :phenylalanyl-tRNA synthetase alpha chain] [GNrpheS ] [CL phenylalanine- -tRNA 
ligase alpha chain] [ORrBacillus subtilis] [EC : 6 . 1 . 1 . 20] [DB:pirl] 

>gp: [GI:ell84113:g2635329] [LN :BSUB0015] [AC : Z99118 : AL009126] [PN :phenylalanyl-tRNA 
synthetase (alpha subunit)] [GNrpheS] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[EC:6.1.1.20] [DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 
3013540.] [SP:P17921] [LE:133410] [RE:134444] [DI : complement] 

>gp: [GI:ell65324:gl770030] [LN :BSZ75208] [AC:Z75208] [PN : phenylalany- tRNA synthetase 
beta subunit] [GN:pheS] [ORrBacillus subtilis] [DB :genpept-bctl] [EC : 6 . 1 . 1 . 20] 
[DE:B. subtilis genomic sequence 89009bp.] [NT :phenylalanyl-tRNA synthetase beta 
subunit] [SP:P17921] [LE:35463] [RE:36497] [DIrdirect] 



ORF Name 



xt^t^ _ _ NT AA 

NTID AAID — , — , score 

LengthLength 



probability 



IAI7503000990 4103438 c3 970 



J [1881 | [5553 ||2S5~| [84 | pTU~[|l . Se-05 



] 



Description 

pir: [LN:D69874] [AC:D69874 ] [PN : conserved hypothetical protein ylbG] [GN:ylbG ] 
[ORrBacillus subtilis] [DB:pir2] >gp : [GI : e334775 :g2340003] [LN: BS16823KB] [AC:Z98682] 
[PN: YlbG protein] [GNrylbG] [OR:Bacillus subtilis] [DB : genpept -bet 1 ] [DE:Bacillus 
subtilis genomic DNA 23.9kB fragment.] [LE: 14055] [RE: 14327] [DI: direct] 
>gp: [GI:ell85090 :g2633871] [LN: BSUB0008] [AC : Z99111 : AL009126] [GN : ylbG] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to hypothetical proteins] [LE:173538] 
[RE : 173810] [DI : direct] 



ORF Name 



NTID 



NT 



AA 



17503000990 4298177 cl 776 



Description 
NO-HIT 



^ Le^thLe^th 
[5554 ||12g | [41 | 



score probability 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000990 4300332 cl 700 



][ 



[5655 [[TT?F]|442 ||98 ||0. 00059 



Description 

gp: [GI:g5306148] [LN : AF160864] [AC : AF160864] [PN:orf365] [GN:orf365] [OR : Mitochondrion 
Tetrahymena pyriformis] [SR:Tetrahymena pyriformis] [DB: genpept] [DE : Tetrahymena 
pyriformis mitochondrial DNA, complete genome.] [NT:Open reading frame ymf65 (CPGN) ; 
ATA initiation] [LE : 15906] [RE: 17003] [DI : complement] 
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ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



1A1 7 503000990 43065^ cl 760 



J 



[5655 |p^~||101 |pT- 



probability 
l.be-44 



Description 

sp: [LN:YLLB_STAAU] [AC:O07319] [GN:YLLB] [OR : STAPHYLOCOCCUS AUREUS] [DE : HYPOTHETICAL 
17.4 KD PROTEIN] [SP:O07319] [DB : swissprot ] >gp : [GI :g2149890] [LN : SAU94 706] [AC:U94706] 
[PN:unknown] [GN:yllB] [OR : Staphylococcus aureus] [DB : genpept -bet 2 ] [DE : Staphylococcus 
aureus strain ATCC 8325-4 cell wall/cell divisiongene cluster, yllB, yllC, yllD, pbpA, 
mraY, murD, divlB, ftsA andftsZ genes, complete cds . ] [LE:537] [RE: 971] [DI : direct] 



ORF Name 



NTID 



NT AA 
AAID — _ — , , score 
LengthLength 



probability 



IAI7503000990 4335053 c3 934 



|5657 | [TT 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



LAI7503000990 4457788 t2 344 



[5658 ||17U|pr 



probability 
5.3e-2I 



Description 

pir: [LN:D71623] [AC:D71623 ] [PN : erythrocyte membrane protein PfEMP3 PFB0095c] 
[GN:PFB0095c ] [OR : Plasmodium falciparum] [DB:pir2] >gp : [GI :g3845093] [LN: AE001371] 
[AC:AE001371:AE001362] [PN : erythrocyte membrane protein PfEMP3] [GN : PFB0095c] 
[OR Plasmodium falciparum] [SR: malaria parasite P. falciparum] [DB : genpept -inv2] 
[DE : Plasmodium falciparum chromosome 2, section 8 of 73 of the completesequence . ] 
[NT identified by sequence similarity] [LE : 1407 : 8817] [RE : 8621 : 8927] 
[DI : complement Join] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



E 



17503000990 4509651 cl 783 



1887 | [5659 [pi— 1|136 ||16B ||1.2e-12 



Description 

gp: [GI:el453986 :g4835313] [LN:SC5H1] [AC : AL049863] [PN : hypothetical protein] 
[GN:SC5H1 . 10c] [OR : Streptomyces coelicolor] [DB : genpept -bet 1] [DE : Strep tomyces 
coelicolor cosmid 5H1.] [NT : SC5H1 . 10c, hypothetical protein, len: 160 aa;] [LE:8093] 
[RE : 8575] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IA17503000990 4532957 c3 982 



[5660 |p53~||785 |piS5||0.0 



J 



Description 

gp: [GI:el333200:g3776110] [LN : SATRXA] [AC : AJ223480] [PN:MutS-like protein] [GN:mutS] 
[ FN : DNA mismatch repair protein] [OR : Staphylococcus aureus] [DB : genpept -bet 1] 
[DE staphylococcus aureus trxA and uvrC genes and partial mutS and dhsCgenes.] [LE:<1] 
[RE:2161] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID _ — . , . 



LengthLength 



score 



[A17503000990 4696068 cl 741 



| 566i ||186 ~|| 6i |pT 



probability 
5.7e-20 



Description 

sp: [LN:RL32_BACST] [AC:P07840] [GNrRPMF] [OR:BACILLUS S TE AROTHERMO PH I LUS ] [DE:50S 
RIBOSOMAL PROTEIN L32 (RIBOSOMAL PROTEIN I) (BL37) ] [SP:P07840] [DB : swissprot] 
>pir: [LN:R5BS37] [AC:S07236 ] [PN : ribosomal protein L32 : ribosomal protein 
BL37 : ribosomal protein I] [CL : Escherichia coli ribosomal protein L32] [OR: Bacillus 
stearothermophilus] [DB:pirl] 
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NT AA 

ORF Na me NTID AAID — „ — , , score prob ability 
Length Length JL 

|Al7503Q0099()_472b000_cl_797 | [1590 | [5662 || 3636 | |1211 | p42fe | |6 . 2e-'Ab2 

Description 

pir: [LN:G69708] [AC : G69708 : JC4819 : PC4 02 9 ] [PN : chromosome segregation SMC 
protein rminichromosome stabilizing protein SMC] [GNrsmc ] [CL : conserved hypothetical 
P115 protein] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell85185 :g2633966] [LN : BSUB0009] 
[AC : Z99112 : AL009126] [PN : chromosome segregation SMC protein homolg] [GN:smc] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT : alternate gene name: ylqA] [LE:67539] [RE:71099] 
[DI : direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



17503000990 4725415 c2 873 1 11891 I 15553 112337 I I77B 1 13205 110. 0 



Description 

gp: [GI:dl023422:g2463561] [LN : AB007500] [AC : ABO 075 00 : D28879] [PN:penicillin-binding 
protein 1] [GN : pbpA] [OR : Staphylococcus aureus] [SR : Staphylococcus aureus 

(strain :NCTC8325) DNA] [DB :genpept-bctl] [DE : Staphylococcus aureus genes for 
penicillin-binding protein 1, MraY,MurD, partial and complete cds . ] [LE:594] [RE: 2828] 

[DI:direct] >gp : [GI :g2149893] [LN : SAU94 706] [AC:U94706] [PN : penicillin-binding protein 
1] [GN:pbpA] [OR staphylococcus aureus] [DB : genpept -bet 2 ] [DE : Staphylococcus aureus 
strain ATCC 8325-4 cell wall/cell divisiongene cluster, yllB, yllC, yllD, pbpA, mraY, 
murD, divlB, ftsA andftsZ genes, complete cds.] [LE:2318] [RE:4552] [DI:direct] 



NT AA 

ORF Name NTID AAID — — ^ score probability 
LengthLength • L - 



A17503000990_47410i0_c3_956 | [1892 | |b664 |p?5~| |281 ||550 ||9.8e-64 



Description 

sp: [LN:SUHB_BACSU] [AC:Q45499] [GN : SUHB] [OR:BACILLUS SUBTILIS] [DE : EXTRAGENIC 
SUPPRESSOR PROTEIN SUHB HOMOLOG] [SP:Q45499] [DB : swissprot] >pir : [LN:E69864] [AC:E69864 
] [PN :myo- inositol -1 (or 4 ) -monophosphatase homolog yktC] [GN:yktC ] [CL : suppressor 
protein suhB] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e!185057 :g2633838] [LN: BSUB0008] 
[AC:Z99111:AL009126] [GN:yktC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 8 of 21): from 1394791to 1603020.] 
[NT:similar to myo- inositol -1 (or 4) -monophosphatase] [SP:Q45499] [LE:142056] 
[RE:142853] [DI:direct] >gp: [GI :g3282150] [LN : AF012285] [AC : AF012285 : AF012284 :U51911] 
[PN:unknown] [GN:yktC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bct2] 
[DE:Bacillus subtilis mobA-nprE gene region.] [NT:similar to E. coli extragenic 
suppressor protein] [LE: 42544] [RE: 43341] [DI: direct] 



NT AA 

ORF Name NTID AAID — ^ T — __ score probability 
LengthLength c L - 



5.3e-223 



[Al7b03000990_4741703_c2_894 | p^T 

Description 

sp: [LN : PRIA_BACSU] [AC : P94461 : 034 941] [GN : PRIA] [OR:BACILLUS SUBTILIS] [DE : PRIMOSOMAL 
PROTEIN N» (REPLICATION FACTOR Y) ] [SP : P94461 : 034941] [DB : swissprot] >pir : [LN : A69682] 
[AC:A69682 ] [PN : primosomal replication factor Y priA] [GN:priA ] [CL :unassigned DEAD/H 
box helicases : DEAD/H box helicase homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85162:g2633943] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN : primosomal 
replication factor Y (primosomal] [GN :priA] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:alternate gene name: yloJ] [SP:P94461] [LE:45047] [RE:47464] [DI:direct] 
>gp: [GI:e323502 :g2337800] [LN: BSY13937] [AC:Y13937] [PN:putative PriA protein] 
[GN:yloJ] [FN:unknovm] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
genomic DNA from the spoVM region.] [SP:P94461] [LE:6819] [RE:9236] [DI:direct] 
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ORF Name 



NT AA 

NTID AAID — _ T .score 

LengthLength 



probability 



U7503000990 4769512 t2 349 



Description 
NO-HIT 



] [1894 | [5656 ||iib~[[44 | 



ORF Name 



117503000990 4781517 C2 888 



NTID 



NT AA 
AAID — , » _ — . , score 
LengthLength 



probability 



TEST 



TT9"" 



[52" 



Description 
NO-HIT 



ORF Name 



E 



17503000990 4859425 t2 275 



Description 
NO-HIT 



NT AA 
N TID AAID — _ — _ score 
LengthLength 

|iB96 | [5668 |pi~||46 | 



probability 



ORF Name 



NTID 



NT AA 
AAID — t — , , score 
LengthLength 



probability 



IAI7503000990 4875055 EI 23 



[1897 | |55g9 |p5~ ||74 |[74 ||0.0iS 



Description 

sp: [LN:Y27B_METJA] [AC:P81234] [GN: MJ0275 . 1] [OR : METHANOCOCCUS JANNASCHII] 
[DE: HYPOTHETICAL PROTEIN MJ02 75 . 1] [SP:P81234] [DB : swissprot] >gp : [GI : g2826269] 
[LN:U67482] [AC : U67482 : L77117] [PN:M. jannaschii predicted coding region MJ0275.1] 
[GN:MJ0275 . 1] [OR : Methanococcus jannaschii] [DB : genpept-bct2] [DE :Methanococcus 
jannaschii section 24 of 150 of the complete genome.] [NT:Brute Force ORF; identified 
by GeneMark; putative] [LE:9344] [RE: 9790] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



probability 



IA17503000990 4875327 t3 587 



Description 
NO-HIT 



] [1898 | [5670 ||135 1 144 | 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



IA17503000990 4875452 c3 1034 



][ 



TEW 



[5671 |p3~||214 ||521 ||4.6e-B0 



Description 

gp: [GI:g4982295] [LN : AE001811] [AC : AE001811 : AE000512] [PN : ribulose -phosphate 
3-epimerase] [GN:TM1718] [OR : Thermo toga maritima] [DB : genpept-bct2] [DE : Thermo toga 
maritima section 123 of 136 of the complete genome.] [NT: similar to GB:L77117 SP:Q58093 
PID:1591395 percent] [LE:2607] [RE:3269] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID . T . _ 

LengthLength 



score 



probability 



IAI7503000990 4876542 ti 175 



1900 | [5672 |p3~||210 | pi~||3 . Oe-53 



Description 

pir : [LN:D69862] [AC:D69862 ] [PN : f ormylmethionine deformylase homolog ykrB] [GN:ykrB ] 
[CL: polypeptide deformylase] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI :el!85046 :g2633827] [LN : BSUB0008 ] [AC : Z99111 : AL009126] [GN:ykrB] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to f ormylmethionine deformylase] 
[LE:130810] [RE:131364] [DI : complement] >gp : [GI :g3282140] [LN: AF012285] 
[AC:AF012285:AF012284 :U51911] [PN:unknown] [GN:ykrB] [FN:unknown] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis mobA-nprE gene region.] [NT:similar 
to polypeptide deformylase from Mycoplasma] [LE: 31298] [RE: 31852] [DI : complement] 



449 



ORF Name 



NT ID 



NT AA 
AAID — «^ _ — . score 
LengthLength 



probability 



|A1 7 B03000990_4B , m03_clJ793 



] [1901 | [5573 [|3ll ~|p6i ||5.7e-55 



Description 

pir: [LN:H69620] [AC:H69620 ] [PNimalonyl CoA-acyl carrier protein transacylase fabD] 
[GN:fabD ] [CL: [acyl-carrier-protein] S-malonyltransf erase : [acyl -carrier-protein] 
S-malonyltransf erase homology] [ORrBacillus subtilis] [DB:pir2] 

>gp: [GI:ell85181:g2633962] [LN : BSUB0009] [AC : Z99112 : AL009126] [PNimalonyl CoA-acyl 
carrier protein transacylase] [GNrfabD] [FN: fatty acid biosynthesis] [OR: Bacillus 
subtilis] [DB:genpept-bctl] [EC : 2 . 3 . 1 .39] [DE:Bacillus subtilis complete genome 
(section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: ylpE] [LE:64546] 
[RE:65499] [DI:direct] >gp: [GI : e323514 :g2337819] [LN: BSY13937] [AC:Y13937] [PN:putative 
FabD protein] [GN: ylpE] [FN: unknown] [OR: Bacillus subtilis] [DB :genpept-bctl] 
[DE: Bacillus subtilis genomic DNA from the spoVM region.] [LE: 26318] [RE: 27271] 
[DI : direct] 



ORF Name 



NT 



AA 



NTID AAID — , T — _ 
LengthLength 



score 



|A1750300u990 4899187 t3 559 



J [1902 I [5574 |[TTT3~|p82 |pT 



probability 
1. 9e-37 



Description 

pir: [LN:B69875] [AC:B69875 ] [PN : conserved hypothetical protein ylbM] [GN:ylbM ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e334781 :g2340009] [LN:BS16823KB] [AC:Z98682] 
[PN : YlbM protein] [GN:ylbM] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis genomic DNA 23.9kB fragment.] [LE: 18936] [RE: 20183] [DI : complement] 
>gp: [GI:ell85096:g2633877] [LN: BSUB0008] [AC : Z99111 : AL009126] [GN : ylbM] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
8 of 21): from 1394791to 1603020.] [NT:similar to hypothetical proteins] [LE:178419] 
[RE : 179666] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



probability 



IM7S03000tt90_4970462_c2_904 



] [1903 | jbfc'/b ||750~[p49 ||837 ||1.5e-83 



Description 

sp : [LN : FABG_BACSU] [AC : P51831 : 031733] [GN: FABG] [OR:BACILLUS SUBTILIS] [EC : 1 . 1 . 1 . 100] 
[DE : ACYL CARRIER PROTEIN REDUCTASE)] [SP : P51831 : 031733 ] [DB : swissprot ] >pir : [LN: A69621] 
[AC:A69621 :PC4176 ] [PN : 3-oxoacyl- [acyl-carrier-protein] reductase, : 3-ketoacyl-acyl 
carrier protein reductase fabG:srb protein] [GN:fabG:srb ] [CL:ribitol 
dehydrogenase: short -chain alcohol dehydrogenase homology] [OR:Bacillus subtilis] 
[EC:1.1.1.100] [DB:pir2] >gp : [GI :ell85182:g2 633963] [LN : BSUB0009] [AC : Z99112 : AL009126] 
[PN:3-ketoacyl-acyl carrier protein reductase] [GN:fabG] [FN: fatty acid biosynthesis] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [EC : 1 . 1 . 1 . 100] [DE: Bacillus subtilis complete 
genome (section 9 of 21): from 1598421to 1807200.] [NT : alternate gene name: ylpF] 
[SP:P51831] [LE:65492] [RE:66232] [DI: direct] 



ORF Name 



[AI7503000990 FT 43 



Description 
NO-HIT 



NTID 



NT 

AAID 



AA 



LengthLength 



score probability 



[5676 |P^~| |46 | 



ORF Name 



AI7503000990_b096012_t3_48I 

Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



J |1905 | [5577 |7F 



450 



NT AA 

ORFJTame NTID AAID ^— thhe — fcfa score probability 



AI7503000990J>iiVl*>2_c2_826 | [1905 | [ 5578 | | 1023 1 | 340 | | |7 . le-93 



Description 

gp: [GI:dl039221:g4514629] [LN: ABO 16 8 94] [AC : AB016894] [PN:bd-type quinol oxidase 
subunit II] [GN : cbdB] [OR: Bacillus stearothermophilus] [SR: Bacillus stearothermophilus 
(strain:K1041) DNA] [DB : genpep t - be t 1 ] [DE:Bacillus stearothermophilus genes for bd-type 
quinol oxidasesubunit I and subunit II, complete cds . ] [NT : cytochrome bd-type quinol 
oxidase] [LE:1706] [RE:2734] [DIrdirect] 

NT AA 

ORF Name NTID AAID — _^ — , , score probability 
LengthLength c L - 



|AI7S03000990_SiB7963_t3_S73 | [1907 | [ 5579 |p5~~| |3i2 | pT5~|| 3 . 7e-22 — | 

Description 

pir: [LN:E69827] [AC:E69827 ] [PN:glycerophosphodiester phosphodiesterase homolog yhdW] 
[GN:yhdW ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell82963 :g2633297] [LN: BSUB0006] 
[AC:Z99109:AL009126] [GN:yhdW] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to glycerophosphodiester phosphodiesterase] [LE: 37668] [RE: 38399] 
[DI: complement] >gp: [GI :ell91883 :g2 226218] [LN:BSY14082] [AC:Y14082] [PN : hypothetical 

protein] [GN:yhdW] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 

chromosomal DNA, region 72 to 75 degrees: spoVRto sspB.] [NT: Similarity to glycerol 

diester phosphodiesterase] [LE: 24316] [RE: 25047] [DI : complement] 

NT AA 

ORF Name NTID AAID — , — , score probability 

LengthLength i - 

A17!>()3000990_Si62760_t3_S14 | [ 1908 | [ 5580 |p^5~| | 42 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



[AI7503000990 bli767I2 c3 1015 



LengthLength - 
[1909 | [ 5681 | [7UB— | [235 | p3~ || 1 . ie-64 



Description 

sp: [LN:DCOP_LACLC] [AC:P50924] [GN : PYRF] [OR : LACTOCOCCUS LACTIS] 

[SR: , SUBSPCREMORIS : STREPTOCOCCUS CREMORIS] [EC : 4 . 1 . 1 . 23] [DE : DECARBOXYLASE) ] 

[SP:P50924] [DB : swissprot ] >gp : [GI : e264705 : gl070361] [LN : LLCPYRDB] [AC:X74207] [PN:OMP 

decarboxylase] [GN:pyrF] [OR : Lactococcus lactis] [DB : genpept-bctl] [DE:L.lactis pyrD 

and pyrF genes.] [SP:P50924] [LE:5309] [RE: 6022] [DI: direct] 



NT AA 

ORF Name NTID AAID t — ^ t — , , score probabil ity 

LengthLength L - 



IAI7503000990 595312 ci 733 



Description 
NO-HIT 



[1910 | [5682 ||TT7 | |48 | 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000990_S974I38_iI_I58 

Description 

NO-HIT 



|5683 ip 04 I [5T 



451 



NT AA 

ORF Name NTID AAID Le ~ thLe ^ t h scor ^ probability 



|Al7603000990J>97H4!>3_c3_1029 | [1912 | [ 5584 | | 492 j | 163 | p^T~ || 1 . 6e-22 

Description 

sp : [LN : DEF_BACSU] [AC:P94462] [GN : DEF] [ORrBACILLUS SUBTILIS] [EC : 3 . 5 . 1 . 31] 
[DE : DEFORMYLASE ) ] [SP:P94462] [DB : swissprot] >pir : [LN: F69613] [AC:F69613 ] 
[PN: polypeptide deformylase def] [GN:def ] [CL : polypeptide def ormylase] [OR: Bacillus 

subtilis] [DB:pir2] >gp : [GI : e290287 :gl772499] [LN :BSPRIADFS] [AC:Y10304] 

[PN: polypeptide deformylase] [GN:def] [OR:Bacillus subtilis] [DB : genpept-bctl] 

[DE :B. subtilis priA, def, fmt, sun genes . ] [SP:P94462] [LE:1762] [RE:2244] [DI:direct] 

>gp: [GI :el!85163 :g2633944] [LN : BSUB0009] [AC : Z99112 : AL009126] [PN: polypeptide 

deformylase] [GN:def] [OR: Bacillus subtilis] [DB : genpept-bctl] [EC : 3 . 5 . 1 . 31] 
[DE:Bacillus subtilis complete genome (section 9 of 21): from 1598421to 1807200.] 
[NT:alternate gene name: yloK] [SP:P94462] [LE:47491] [RE:47973] [DI:direct] 

>gp: [GI:e323523:g2337801] [LN : BSY13937] [AC:Y13937] [PN:putative Def protein] [GN : yloK] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis genomic DNA 

from the spoVM region.] [SP:P94462] [LE:9263] [RE:9745] [DI:direct] 



NT AA 

ORF Name NTID AAID — , — , . score probability 
— — LengthLength — — — J L 



|&I7S03000990Ji023693_ci_730 | [1913 | [5685 |p^~| |95 | j |3 . 2e-25 

Description 

pir: [LN:D69873] [AC:D69873 ] [PN: hypothetical protein ylaN] [GN:ylaN ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell85074:g2 633855] [LN: BSUB0008] [AC : Z99111 : AL009126] 
[GN:ylaN] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:157027] [RE:157308] 
[DI: direct] >gp: [GI :ell91897 :g2224773] [LN: BSZ97025] [AC:Z97025] [GN:ylaN] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis nprE, 

yla[A,B,C,D,E,F,G,H, I, J,K,L,M,N,0] and pycAgenes . ] [LE:11286] [RE:11567] [DI:direct] 



NT AA 

ORF Name NTID AAID — _ „ — _ score probability 
LengthLength ^ 

|AI7603000990_6072125_cl_766 | [1914 | [5686 |p2~ 1 1 223 | [ggT~| |l . le-flb | 

Description 

gp: [GI:g4009493] [LN: AF068904] [AC : AF068904] [PN: YlmE] [GN : ylmE] [OR : Staphylococcus 
aureus] [DB :genpept-bct2] [DE : Staphylococcus aureus cell division protein FtsZ (ftsZ) 
gene, partial cds; YlmD (ylmD) , YlmE (ylmE), YlmF (ylmF) , YlmG (ylmG) , and YlmH (ylmH) 
genes, complete cds; and cell division proteinDivIVA (divIVA) gene, partial cds.] 
[NT: similar to Bacillus subtilis YlmE] [LE:1246] [RE: 1920] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



AI7503000990_61i5700_t3_647 [ [1915 | [5587 |pUT]| 467 | p5g"" | |7 ■ Be-64 J 

Description 

pir: [LN:H69858] [AC:H69858 ] [PN: cation ABC transporter (ATP-binding protei) homolog 
ykoD] [GN : ykoD ] [CL : ATP-binding cassette homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell81522:g2632042] [LN : BSAJ2 571] [AC : AJ002571] [PN: YkoD] [GN: ykoD] [OR:Bacillus 
subtilis] [DB: genpept-bctl] [DE:Bacillus subtilis 168 56 kb DNA fragment between xlyA 
and ykoR.] [NT : homologous to cobalt transport ATP binding protein] [LE:41106] 
[RE:42578] [DI : complement] >gp : [GI : ell83342 :g2633676] [LN : BSUB0007] 

[AC:Z99110:AL009126] [GN: ykoD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 7 of 21): from 1194391to 1411140.] 
[NT:similar to cation ABC transporter (ATP-binding] [LE:194055] [RE:195527] 
[DI : complement] 



452 



ORF Name 



NT AA 
NTID AAID — _ — ^ score 
LengthLength 



probability 



IAI7503000990 61J1693 ci 752 



] [1915 | [5588 ||414"-||i37 ||b74 |[i.le-55 



] 



Description 

gp: [GI:dl023421 :g2463560] [LN : AB007500] [AC : ABO 075 00 : D2 8879] [OR : Staphylococcus aureus] 
[SR: Staphylococcus aureus (strain:NCTC8325) DNA] [DB :genpept-bctl] [DE : Staphylococcus 

aureus genes for penicillin-binding protein 1, MraY,MurD, partial and complete cds . ] 
[NT:unnamed protein product] [LE:212] [RE:613] [DI:direct] >gp : [GI : g2149892] 
[LN:SAU94706] [AC:U94706] [PN:cell division protein] [GN : yllD] [OR : Staphylococcus 

aureus] [DB : genpept - be t 2 ] [DE : Staphylococcus aureus strain ATCC 8325-4 cell wall/cell 

divisiongene cluster, yllB, yllC, yllD, pbpA, mraY, murD, divlB, ftsA andftsZ genes, 

complete cds.] [LE:1936] [RE: 2337] [DI : direct] 



ORF Name 



NT 



AA 



NTID AAID Le ~ thLe ~ th score probability 



03000990 6149077 c3 1033 



] [1917 | [5589 [pi - 1|296 ||708 ||7.0e-70 



Description 

pir : [LN:A69879] [AC:A69879 ] [PN : conserved hypothetical protein yloQ] [GN:yloQ ] 
[CL:conserved hypothetical protein HI1714] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85169:g2633950] [LN : BSUB0009] [AC : Z99112 : AL009126] [GNryloQ] [FNrunknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
9 of 21): from 1598421to 1807200.] [NT:similar to hypothetical proteins] [LE:54082] 
[RE:54978] [DI:direct] >gp : [GI : e323525 :g2337807] [LN : BSY13 93 7] [AC:Y13937] [PN:YloQ 
protein] [GN :yloQ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis genomic DNA from the spoVM region.] [LE: 15854] [RE: 16750] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID , _ v 

LengthLength 



score 



probability 



U7503000990 632762 r3 571 



Description 
NO-HIT 



^ [1918 | [5690 |p?2 | |7T" 



ORF Name 



NTID AAID 



^£7503000990 €42 c3 1005 



][ 



NT AA 
LengthLength 



score probability 
14356 110. 0 



Description 

sp: [LN:SYI__STAAU] [AC:P41972] [GN: ILES] [OR : STAPHYLOCOCCUS AUREUS] [EC:6.1.1.5] 
[DE : (ILERS) ] [SP:P41972] [DB : swissprot] >pir : [LN : S40178] [AC:S40178 ] 
[ PN : i sol euc ine - - tRNA ligase, : isoleucyl -tRNA synthetase] [CL : isoleucine- - tRNA ligase] 
[OR : Staphylococcus aureus] [EC:6.1.1.5] [DB:pir2] >gp : [GI :g437916] [LN: SAILES] 
[AC:X74219] [PN : isoleucyl - tRNA synthetase] [GN:ileS] [OR : Staphylococcus aureus] 
[DB:genpept-bctl] [DE:S. aureus gene for isoleucyl -tRNA synthetase.] [SP:P41972] [LE:91] 
[RE:2844] [DI:direct] 



ORF Name 



NTID AAID _ — . , _ — . 



|A^7503000990 5720452 c3 936 



NT AA 
LengthLength 
[1920 | [5692 ||1355 | 



score probability 
11266 ||5.2e-129 



Description 

gp: [GI:dl039220:g4514628] [LN : AB016894] [AC : AB016894] [PN:bd-type quinol oxidase 
subunit I] [GN: cbdA] [OR: Bacillus stearothermophilus] [SR: Bacillus stearothermophilus 
(strain:K1041) DNA] [DB : genpept-bctl] [DE:Bacillus stearothermophilus genes for bd-type 
quinol oxidasesubunit I and subunit II, complete cds.] [NT : cytochrome bd-type quinol 
oxidase] [LE:363] [RE: 1709] [DI: direct] 



453 



NT AA 

ORF Name NTID AAID Le ^ t fr Length score probability 



|AI7503000990J>7mbb_t3_b30 | [1921 | [5593 | pT7~| [ 48 |p [[ 0.045 | 



Description 

gp: [GI:g4049913] [LN : AF063866] [AC : AF063866 ] [PN:ORF MSV141 hypothetical protein] 
[GN:MSV141] [OR : Melanoplus sanguinipes entomopoxvirus] [DB :genpept-vrl] [DE : Melanoplus 
sanguinipes entomopoxvirus, complete genome.] [LE: 121430] [RE: 121630] [DI : complement] 

NT AA 

ORF Name NTID AAID — — , score probability 
Length Length ^ JL 



IAI7503000990 6834687 c2 831 



1922 | - |5CTr-||309 | [102 | p5~ ||5 . 3e-17 



Description 

pir : [LN:C69864] [AC:C69864 ] [PN : hypothetical protein yktA] [GN:yktA ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell85054 :g2633835] [LN : BSUB0008] [AC : Z99111 : AL009126] 
[GN:yktA] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:140551] [RE:140817] 
[DI:direct] >gp: [GI :g3282148] [LN : AF012285] [AC : AF012285 : AF012284 :U51911] [PN:unknown] 
[GN:yktA] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 2 ] [DE:Bacillus subtilis 
mobA-nprE gene region.] [LE: 41039] [RE: 41305] [DI: direct] 

— ^— — — — ^ , -m npn— !— ^ — — — — "-j— — ^— — — nam , 

NT AA 

ORF Name NTID AAID — _ T — , . score probabi lity 
Length Length c JL 

p7503000990_722b000_c3_1038 | [1923 [ [5695 [[TuTT] [341 [[898 ||5T7e-90 | 

Description 

pir: [LN:H69679] [AC:H69679 ] [PN:involved in fatty acid/phospholipid synthesis plsX] 
[GN:plsX ] [CL: phospholipid synthesis protein] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell85180:g2633961] [LN : BSUB0009] [AC : Z99112 : AL009126] [GN:plsX] [FN:involved in 
fatty acid/phospholipid synthesis] [OR: Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 9 of 21) : from 1598421to 1807200.] 
[NT: alternate gene name: ylpD] [LE: 63526] [RE: 64527] [DI: direct] 
>gp: [GI:e323529:g2337818] [LN : BSY13 93 7] [AC:Y13937] [PN:putative PlsX protein] 
[GN:ylpD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
genomic DNA from the spoVM region.] [LE: 25298] [RE: 26299] [DI: direct] 



NT AA 

ORF Name NTID AAID — — _ score probability 
LengthLength 



AI7503000990_7320465_ci_719 [ [1924 [ [5696 [276 ||525 ||1.7e-50 



Description 

pir: [LN:G70179] [AC:G70179 ] [PN : spermidine /putrescine ABC transporter, permease 
protein (potC) homolog] [CL : spermidine /putrescine transport system permease protein 
potl] [OR:Borrelia burgdorferi] [SR:, Lyme disease spirochete] [DB:pir2] 
>gp: [GI:g2688564] [LN: AE001165] [AC : AE001165 : AE000783] [PN : spermidine/putrescine ABC 
transporter, permease] [GN:BB0640] [OR:Borrelia burgdorferi] [SR:Lyme disease 
spirochete] [DB :genpept-bct2] [DE:Borrelia burgdorferi (section 51 of 70) of the 
complete genome.] [NT: similar to GB:M64519 SP:P23859 PID: 147328 GB:U00096] [LE: 10307] 
[RE : 11098] [DI : complement] 

NT AA 

ORF Name NTID AAID — _ „ — , , score probability 

LengtnLengtn 



AI7503000990_86717g_c3_924 | [1925 [ [5697 [piT] [ 503 | [ 1090 | [2 . 3e-il0 



Description 

pir: [LN:C70468] [AC:C70468 ] [PN : phosphoribosylaminoimidazolecarboxamide 
formyl transferase] [GN: purH ] [CL:purH bifunctional enzyme] [OR:Aquifex aeolicus] 
[DB:pir2] >gp: [GI :g2984204] [LN: AE000765] [AC : AE000765 : AE000657] 
[PN:phosphoribosylaminoimidazolecarboxamide] [GN :purH] [OR: Aquifex aeolicus] 
[DB:genpept-bct2] [DE: Aquifex aeolicus section 97 of 109 of the complete genome.] 
[LE : 7599] [RE : 9119] [DI : complement] 



454 



ORF Name 



NTID 



NT 



AA 



AAID _ — . , . — s 



LenqthLength 



score 



IAI7503000990 892141 t2 303 



] |1926 | [5598 |p5~] |131 | p^T" 



probability 
I.2e-24 



Description 

gp: [GI:el487496:g5042304] [LN : MMU242955] [AC : A J2 42955] [PN : hypothetical protein 
(P4(21)n)] [GN:P4(21)n] [OR:Mus musculus] [SR:house mouse] [DB : genpept -rod] [DE:Mus 

musculus partial mRNA for hypothetical protein (P4(21)n gene).] [NT:similar with 

amyh_yeast glucoamylase S1/S2] [LE:<1] [RE:1203] [DI:direct] >gp : [GI :dl042681 :g5103287] 
[LN:AB028868] [AC : AB028868] [GN:P4(21)n] [OR:Mus musculus] [SR:Mus musculus mRNA] 
[DBrgenpept] [DE:Mus musculus P4(21)n mRNA, partial cds . ] [NT : The protein is similar 

with AMYH_YEAST GLUCOAMYLASE] [LE:<1] [RE: 1203] [DI: direct] 



ORF Name 



IAI7503000990 892312 c3 980 



Description 
NO-HIT 



NTID AAID 



NT AA 
LengthLength 
|5g99 |pr~] [ET 



score probability 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000990 958785 F2 361 



1928 | [5700 ||939 | |312 | |rT7¥]|7 . 4e-130 



Description 

gp: [GI:g3320606] [LN : AF072726] [AC : AF072726] [PN:putative heme A synthase] [GN: ctaA] 
[OR staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus aureus putative heme A 
synthase (ctaA) gene , complete cds.] [NT: CtaA] [LE:13 9] [RE: 1050] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID — 

LengthLength 



IM7503000990 968811 cl 753 



|5701 | |?T5~ 



score probability 
b99 p.Be-68 



Description 

sp: [LN:DHSC_BACSU] [AC:P08064] [GN: SDHC] [OR:BACILLUS SUBTILIS] [DE : SUCCINATE 
DEHYDROGENASE CYTOCHROME B-558 SUBUNIT] [SP:P08064] [DB : swissprot] >pir : [LN : DEBSSC] 
[AC:A29843 :I39972 :E69704 ] [PN : succinate dehydrogenase, cytochrome b558 : f umarate 
reductase C protein] [GN: sdhC : sdhA ] [CL : succinate dehydrogenase cytochrome b558] 
[OR:Bacillus subtilis] [EC : 1 . 3 . 99 . 1] [DB:pirl] [MP:70 ] >gp : [GI : gl43525] [LN : BACSDHAB] 
[AC:M13470 :M15107] [OR:Bacillus subtilis] [SR : B . subtilis DNA, clone pKIM4] 
[DB :genpept-bctl] [DE :B . subtilis succinate dehydrogenase complex encoding 
cytochromeb-558 subunit, complete cds, and flavoprotein subunit, 5' end.] [NT : succinate 
dehydrogenase cytochrome b-558 subunit] [LE:265] [RE:873] [DI:direct] 
>gp: [GI:ell84094:g2635310] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN : succinate 
dehydrogenase (cytochrome b558] [GN:sdhC] [OR: Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[SP:P08064] [LE:112102] [RE:112710] [DI : complement ] >gp : [GI :ell65347 :gl7 70051] 
[LN:BSZ75208] [AC:Z75208] [PN : cytochrome b558 subunit of succinate] [GN:sdhC] 
[OR:Bacillus subtilis] [DB : genpept -bet 1] [EC : 1 . 3 . 99 . 1] [DE : B . subtilis genomic sequence 
89009bp.] [SP:P08064] [LE:57197] [RE:57805] [DI:direct] 



ORF Name 



LL7b03000990 969052 r2 434 



Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



[5702 ||T?5 | 



ORF Name 



AI7503000990_988i927_tiJ7t> 

Description 
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ORF Name 
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Description 

Sp : [LN : FTSZ_STAAU] [AC:P45498] [GN : FTSZ] [OR : STAPHYLOCOCCUS AUREUS] [DE: CELL DIVISION 
PROTEIN FTSZ] [SP:P45498] [DB : swissprot] >pir : [LN : S58814] [AC:S58814 ] [PN:cell 
division protein ftsZ] [CL:cell division protein ftsZ] [OR : Staphylococcus aureus] 
[DBipirl] >gp: [GI:g458428] [LN: SAU06462] [AC:U06462] [PN:FtsZ] [GNrftsZ] 
[OR: Staphylococcus aureus] [DB :genpept-bctl] [DE : Staphylococcus aureus SA4 FtsZ (ftsZ) 
gene, complete cds . ] [LE:27] [RE:1199] [DI:direct] >gp : [GI :g2149898] [LN : SAU94706] 
[AC:U94706] [PN:cell division protein] [GN:ftsZ] [OR : Staphylococcus aureus] 
[DB :genpept-bct2] [DE : Staphylococcus aureus strain ATCC 8325-4 cell wall/cell 
divisiongene cluster, yllB, yllC, yllD, pbpA, mraY, murD, divlB, ftsA andftsZ genes, 
complete cds.] [LE: 10052] [RE: 11224] [DI:direct] 
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Description 
NO-HIT 
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LengthLength 



— , score probability 
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Description 

sp: [LN:PUR3_BACSU] [AC:P12040] [GN : PURN] [OR:BACILLUS SUBTILIS] [EC:2.1.2.2] 
[DE: TRANS FORMYLASE) (5 » -PHOSPHOR I BOS YLGLYCINAM IDE TRANS FORMYLASE) ] [SP:P12040] 
[DB: swissprot] >pir: [LN:XYBSGF] [AC : 12 9326 : B6 9685 ] [PN: phosphor ibosylglycinamide 
formyltransf erase, ] [GN :purN ] [CL : phosphor ibosylglycinamide formyltransf erase : 
phosphoribosylglycinamide formyltransf erase homology] [OR: Bacillus subtilis] 
[EC:2.1.2.2] [DB:pirl] [MP:18min] >gp : [GI :gl43372] [LN : BACPURF] [AC : J02732 : K00047] 
[OR:Bacillus subtilis] [SR :B . subtilis (strain DEI (prototroph DER. or W168) ) DNA, clone 
pPZ] [DB :genpept-bctl] [DE : B . subtilis pur operon encoding purine biosynthesis enzymes, 
12genes.] [NT : phosphoribosyl glycinamide formyltransf erase] [LE:9833] [RE: 10420] 
[DI:direct] >gp: [GI :ell82631:g2 632965] [LN : BSUB0004] [AC : Z99107 : AL009126] 
[PN: phosphoribosylglycinamide formyltransf erase] [GN:purN] [FN: purine biosynthesis] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC:2.1.2.2] [DE:Bacillus subtilis complete 
genome (section 4 of 21): from 600701 to813890.] [SP:P12040] [LE:106815] [RE:107402] 
[DI : direct] 
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Description 

Sp: [LN: YWBD_BACSU] [AC:P39587] [GN : YWBD : IPA-19D] [OR.-BACILLUS SUBTILIS] 
[DE: HYPOTHETICAL 44.4 KD PROTEIN IN EPR-GALK INTERGENIC REGION] [SP:P39587] 
[DB: swissprot] >pir : [LN : S39674] [AC : S39674 : B70051 ] [PN:ywbD protein hypothetical 
protein ipa-19d] [GN : ywbD ] [CL:probable methyl transferase b0967] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :g413943] [LN : BSGENR] [AC:X73124] [GN:ipa-19d] [OR:Bacillus 
subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39587] 
[LE:20551] [RE:21741] [DI:direct] >gp : [GI :ell86335:g2 636371] [LN :BSUB0020] 
[AC:Z99123 :AL009126] [GN:ywbD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 20 of 21): from 3798401to 4010550.] 
[NT:alternate gene name: ipa-19d; similar to] [SP:P39587] [LE:136218] [RE:137408] 
[DI : complement] 
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Description 

gp: [GI:g2196686] [LN : EFU25090] [AC:U25090] [PN : pyrimidine biosynthesis protein Ab] 
[GNrpyrAb] [OR : Enterococcus faecalis] [DB :genpept-bctl] [DE : Enterococcus faecalis 

plasmid pKV48 pyrimidine biosynthesis proteinAb (pyrAb) gene, partial cds . ] [LE:<1] 
[RE:>253] [Dlrdirect] 



ORF Name 



NTID AAID 
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score 
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Description 

pir: [LN:T03492] [AC:T03492 ] [PN : hypothetical protein] [OR : Rhodobacter capsulatus] 
[DB:pir2] [MP:1 ] >gp : [GI :g3128293] [LN : AF010496] [AC : AF010496] [PN : hypothetical 
protein] [OR : Rhodobacter capsulatus] [DB : genpept -bet 2 ] [DE : Rhodobacter capsulatus 
strain SB1003, partial genome.] [LE: 54291] [RE: 55613] [D I : complement] 
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Description 

gp: [GI:g4160468] [LN: AF109909] [AC : AF109909] [PN:putative protein] [GN:ykoY] 
[OR: Bacillus megaterium] [DB : genpept-bct2] [DE: Bacillus megaterium polyhydroxyalkanoate 

gene cluster, completesequence . ] [NT: similar to toxic anion resistance protein] 
[LE:277] [RE:1092] [DI:direct] 
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Description 
NO-HIT 
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Description 

pir: [LN:QQSA8T] [AC:A04493 ] [PN : hypothetical protein B-189] [OR : Staphylococcus aureus] 
[DB:pirl] 
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Description 

pir: [LN:H70003] [AC:H70003 ] [PN:probable prolyl aminopeptidase , ytxM: cytosol 

aminopeptidase V:Pro-X aminopeptidase : proline aminopeptidase : proline iminopeptidase] 
[GN:ytxM ] [OR:Bacillus subtilis] [EC : 3 . 4 . 11 . 5] [DB:pir2] >gp : [GI : ell85954 : g2635565] 
[LN:BSUB0016] [AC : Z99119 : AL009126] [GN:ytxM] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 16 of 21) : from 

2997771to 3213410.] [NT : alternate gene name: ytfB; similar to prolyl] [LE: 151061] 
[RE:151885] [DI : complement] >gp : [GI :g2293147] [LN: AF008220] [AC : AF008220] [PN : YtxM] 
[GN:ytxM] [OR:Bacillus subtilis] [DB : genpept -be t2] [DE:Bacillus subtilis rrnB-dnaB 

genomic region.] [NT : similarity with 2 -hydroxy- 6- oxo- 2 , 4-heptadienoate] [LE:28542] 
[RE : 29366] [DI : direct] 
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Description 

pir: [LN:F69841] [AC:F69841 ] [PN : conserved hypothetical protein yitU] [GN:yitU ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : e261990 :gl620926] [LN : BS168NPRB] [AC:Z79580] 
[GN:putative orf] [FNiunknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis 

nprB gene.] [LE:4492] [RE:5304] [DI : complement] >gp : [GI :ell83116 :g2633450] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [GNiyitU] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept -bctl ] [DE:Bacillus subtilis complete genome (section 6 of 21) : from 999501 

tol209940.] [NT:similar to hypothetical proteins] [LE:190463] [RE:191275] 
[DI : complement] >gp: [GI : ell73548 : g2145415] [LN: BSY09476] [AC:Y09476] [PNrYitU] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis 54kb genomic DNA fragment.] 
[NTrputative ORF] [LE : 47689] [RE: 48501] [DI : complement] 

NT AA 

ORF Name NTID AAID — — score probability 
LengthLength *- 



AI7S03000991jL27S0250_cl_400 | [1944 | [5716 |[IT^r|| 373 | pTT| |l . 6e-125 



Description 

pir : [LN:B69669] [AC : B69669 : S15233 : D38447 ] [PN: oligopeptide transport ATP-binding 
protein oppD : sporulation initiation protein spoOKD] [GN: spoOKD : oppD ] [CL: inner 
membrane protein malK : ATP-binding cassette homology] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:ell83166:g2633500] [LN: BSUB0007] [AC : Z99110 : AL009126] [PN : oligopeptide ABC 
transporter (ATP-binding] [GN:oppD] [FN:required for initiation of sporulation,] 
[OR: Bacillus subtilis] [DB : genpept -bctl] [DE: Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT : alternate gene name: spoOKD] [LE:28537] 
[RE:29613] [DI :direct] 



NT AA 

ORF Name NTID AAID — — , score probability 
— — LengthLength — *~ 
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Description 

pir: [LN.-A60998] [AC:A60998 ] [PN : replication protein REP] [OR : Staphylococcus aureus] 
[DB:pir2] >gp: [GI :g295834] [LN: SAPOX2 000] [AC:X55798] [OR : Staphylococcus aureus] 
[DB: genpept -bctl] [DE : S . aureus plasmid pOX2000.] [NT : ORF 154] [LE:587] [RE:1051] 
[DI : direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 
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Description 
NO-HIT 
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Description 

pir: [LN:S75993] [AC:S75993 ] [PN : hypothetical protein] [OR : Synechocystis sp.] [SR:PCC 
6803, , PCC 6803] [SR:PCC6803, ] [DB:pir2] >gp : [GI :dl011491 :gl001353] [LN : SYCSLLLH] 
[AC:D64006 :AB001339] [PN : hypothetical protein] [GN:clpP] [OR : Synechocystis sp . ] 
[SR: Synechocystis sp . (strain : PCC6803 ) DNA] [DB :genpept-bctl] [DE : Synechocystis sp. 
PCC6803 complete genome, 25/27, 3138604-3270709.] [NT : ORF_ID : S110498] [LE:80076] 
[RE: 80528] [DI : complement] 
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Description 

pir: [LN:A70417] [AC:A70417 ] [PN : hypothetical protein aq_1349] [GN:aq_1349 ] 
[OR:Aquifex aeolicus] [DB:pir2] >gp : [GI : g2983776] [LN : AE000736] [AC : AE000736 : AE000657] 
[PN:putative protein] [GN:aq_1349] [OR:Aquifex aeolicus] [DB :genpept-bct2] [DE:Aquifex 

aeolicus section 68 of 109 of the complete genome.] [LE:1950] [RE:2756] [DI:direct] 
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Description 

gp: [GI:gl022726] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept -bet 1 ] 

[DE: Staphylococcus haemolyticus IS12 72 ORF1 and ORF2 genes, completecds . ] [NT :ORFl] 

[LE:1101] [RE: 1922] [DI : complement ] 
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Description 

gp: [GI:gl022726] [LN : SHU3563 5] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR: Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE : Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT : ORF1] 

[LE:1101] [RE: 1922] [DI : complement ] 
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Description 

Sp : [LN : G6PA_BACST] [AC:P13375] [GN: PGIA] [OR:BACILLUS STEAROTHERMOPHILUS] [EC:5.3.1.9] 
[DE : ISOMERASE A) ] [SP:P13375] [DB : swissprot ] >pir : [LN : NUBSSA] [AC : S15936 : S06196 ] 
[PN : glucose - 6 -phosphate isomerase , A : phosphoglucose isomerase : phosphohexose isomerase] 
[CL : glucose- 6 -phosphate isomerase] [OR : Bacillus stearothermophilus] [EC : 5 . 3 . 1 . 9] 
[DB:pirl] >gp: [GI:g40046] [LN : BSPGIA] [AC:X16639] [OR:Bacillus stearothermophilus] 
[DB : genpept -bctl] [DE:Bacillus stearothermophilus pgiA gene for 

phosphoglucoisomeraseisoenzyme A (EC 5.3.1.9).] [NT : phosphoglucose isomerase A (AA 
1-449)] [SP:P13375] [LE:95] [RE:1444] [DI:direct] 



ORF Name 



NTID 



AAID 



17503000991 14648512 c3 585 



"5T2T" 



NT AA 
LengthLength 



score 



probability 



Description 
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Description 

gp: [GI:g3158514] [LN : CELT04D1] [AC : AF067617] [GN:T04D1.2] [OR : Caenorhabditis elegans] 
[DB : genpept - invl ] [DE : Caenorhabditis elegans cosmid T04D1.] 

[LE : 10425: 10567: 1142 0 : 12480] [RE : 10517 : 10646 : 11689 : 12 789] [DI : direct Join] 
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Description 
NO-HIT 
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Description 

pir: [LN:A69863] [AC:A69863 ] [PN: conserved hypothetical protein ykrP] [GN:ykrP ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84942 : g2633723] [LN : BSUB0008] 
[AC:Z99111:AL009126] [GN:ykrP] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:Bacillus subtilis complete genome (section 8 of 21): from 1394791to 1603020.] 
[NT: similar to hypothetical proteins] [LE: 22614] [RE: 23636] [DI : complement] 
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Description 

pir: [LN:H69843] [AC:H69843 ] [PN : hypothetical protein yjbH] [GN:yjbH ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell83175:g2 633509] [LN : BSUB0007] [AC : Z99110 : AL009126] 
[GN:yjbH] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:38695] [RE:39522] 
[DI : complement] 
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Description 

Sp : [LN : BIN3_STAAU] [AC:P20384] [GN:BIN3] [OR : STAPHYLOCOCCUS AUREUS] [DE : POTENTIAL 
DNA- INVERTASE BIN3 (TRANSPOSON TN552) ] [SP:P20384] [DB : swissprot] >gp : [GI : g3 98182 ] 
[LN: SABINR3] [AC:X16298] [OR : Staphylococcus aureus] [DB : genpept-bctl] 
[DE: Staphylococcus aureus plasmid pI9789 DNA with binR and bin3 genes , derived from 
transposon TN552.] [NT:bin3 product] [SP:P20384] [LE:1049] [RE:1657] [DI: direct] 
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Description 

pir : [LN:S42239] [AC:S42239 ] [PN : hypothetical protein 3] [OR : Staphylococcus aureus] 
[DB:pir2] >gp : [GI :g501834] [LN : PNS1CG] [AC:M16217] [ORrPlasmid pNSl] [SR:Plasmid pNSl 
from Staphylococcus aureus, plasmid pTP5 DNA] [DB :genpept-bctl] [DErPlasmid pNSl (from 
Staphylococcus aureus) encodingtetracycline-resistance (tet) , complete genome.] [NT : ORF 
C; putative] [LE:1870] [RE: 2409] [DIrdirect] 
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Description 
NO-HIT 



ORF Name 


NT AA 
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Description 

sp: [LN:G6PD_LEUME] [AC:P11411] [GN:ZWF] [OR : LEUCONOSTOC MESENTEROIDES] [EC : 1 . 1 . 1 . 49] 
[DE: GLUCOSE- 6 -PHOSPHATE 1 -DEHYDROGENASE, (G6PD) ] [SP:P11411] [DB : swissprot] 
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Description 

pir: [LN:C69837] [AC:C69837 ] [PN : 5-oxo- 1 , 2 , 5- tricarboxilic- 3 -penten acid de homolog 
yisK] [GN: yisK ] [CL : 2 -hydroxyhepta-2 , 4 -diene-1 , 7-dioate isomerase] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell83077 :g2633411] [LN: BSUB0006] [AC : Z99109 : AL009126] 
[GN:yisK] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 6 of 21): from 999501 tol209940.] [NT:similar to 
5-oxo-l, 2, 5-tricarboxilic-3-penten acid] [LE:152216] [RE:153121] [DI:direct] 
>gp: [GI:ell73509:g2145376] [LN: BSY09476] [AC:Y09476] [PN: YisK] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE :B. subtilis 54kb genomic DNA fragment.] [NT:putative - Homology 
with oxo-tricarboxilic-pentene] [LE:9442] [RE: 10347] [DI : direct] 
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Description 

gp: [GI:dl045996:g5360820] [LN:D86934] [AC:D86934] [PN: transposase] [OR : Staphylococcus 
aureus] [SR : Staphylococcus aureus (strain :N315) DNA, clone_lib : library of N31] 
[DB:genpept] [DE : Staphylococcus aureus genes, mec region, partial and complete cds.] 
[NT : ORF N026; putative] [LE:19527] [RE:19751] [DI:direct] 
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Description 

pir: [LN:F69844] [AC:F69844 ] [PN : conserved hypothetical protein yjbN] [GN:yjbN ] 
[CLrconserved hypothetical protein HI0072] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI :ell83181 :g2633515] [LN : BSUB0007] [AC : Z99110 : AL009126 ] [GN:yjbN] [FN:unknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NTisimilar to hypothetical proteins] [LE:42741] 
[RE: 43541] [DI : direct] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000991 20517062 c2 47S 



] [1966 | [5738 [pTuT] [333 ||961 ||i.ie-96 



Description 

pir: [LN:F69842] [AC:F69842 ] [PN : 3 -oxoacyl - acyl-carrier protein synthase homolog yjaX] 
[GN:yjaX ] [CL : 3-oxoacyl- [acyl-carrier-protein] synthase III] [OR:Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell83136 :g2633470] [LN: BSUB0006] [AC : Z99109 : AL009126] [GN:yjaX] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 6 of 21): from 999501 tol209940.] [NT:similar to 3-oxoacyl- 

acyl-carrier protein] [LE:208189] [RE:209127] [DI: direct] >gp : [GI : ell83153 :g2633487] 
[LN:BSUB0007] [AC : Z99110 : AL009126] [GN:yjaX] [FN:unknown] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 7 of 21): from 

1194391to 1411140.] [NT:similar to 3-oxoacyl- acyl-carrier protein] [LE:13299] 
[RE : 14237] [DI : direct] 



ORF Name 



NT 



AA 



NT ID AAID — _ T — , . score proba bility 
LengthLength c JL 



[A17503000991 20585302 t2 149 



Description 
NO-HIT 



] [1967 | [5739 |[T25~[|41 | 



ORF Name 



NT ID 



AAID 



E 



17503000991 20739037 cl 452 



NT AA 
LengthLength 
TIE 



score 



probability 
5.7e-68 



Description 

gp: [GI:g2267243] [LN : SEU71377] [AC:U71377] [PN:putative transcriptional regulator AtlR] 
[OR staphylococcus epidermidis] [DB : genpept-bctl] [DE : Staphylococcus epidermidis 

autolysin AtlE and putativetranscriptional regulator AtlR genes, complete cds . ] 
[LE : 6867] [RE : 7286] [DI : complement] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



probability 



IAI7503000991 2125903 El 313 



19S9 | [5741 |pT5-||4i2 | |rg7~||6 .2e-i3 



Description 

pir: [LN:D71112] [AC:D71112 ] [PN : hypothetical protein PH0667] [GN:PH0667 ] 
[OR:Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030701 : g3257075] [LN : AP000003] 
[AC:AP000003 :AB009484 : AB009485 : AB009486 : AB009487 : AB009488 :AB00948 9] [PN:413aa long 

hypothetical protein] [GN:PH0667] [OR : Pyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB : genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 

544001-777000 nt . position (3/7) . ] [LE:51915] [RE:53156] [DI : complement] 



462 



NT AA 

QRF Name ^ ^ he - thhe - th s£oie probability 



A17503000991_21b71937_i3_3gQ | [1970 | [ 5742 | | 213 1 17 0 |pi 1 |0.015 



Description 

pir: [LN:F71120] [AC:F71120 ] [PN : hypothetical protein PH0733] [GN:PH0733 ] 
[OR: Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030767 :g3257141] [LN: AP000003] 
[AC:AP000003 :AB009484 :AB009485 :AB009486 : AB009487 : AB009488 :AB009489] [PN:150aa long 

hypothetical protein] [GN:PH0733] [ORrPyrococcus horikoshii] [SR : Pyrococcus horikoshii 
(strain:OT3) DNA] [DB:genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic DNA, 

544001-777000 nt. position (3/7 ). ] [LE:107943] [RE:108395] [DI : complement] 



NT AA 

QRF Name NTID AAID — _ — _ score probability 
LengthLength c 



&:g7S03000591_2iS54202J:2J.67 | [1971 | [ 5743 ||XT75-j| 39i | |Ig7T]| 3 . Oe-172 



Description 

gp: [GI:el301685:g3256224] [LN: SAY14370] [AC:Y14370] [GNiypfP] [OR : Staphylococcus 

aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus RF3 , murE, ypfP genes.] [LE:5160] 
[RE:6335] [DIrdirect] 



NT AA 

QRF Name NTID AAID — , — , , score probability 
LengthLength c 



LA17b0300099i 21753125 c2 481 



1972 | [ 5744 ||TT5¥] p9 7 | pT~| |2 . Oe-79 



Description 

pir : [LN:S16649] [AC : S16649 : S182 71 : E6 9618 : 140001 ] [PN : dipeptide ABC transporter 
(permease) :dciAC protein] [GN: dppC : dciAC ] [CL : oligopeptide permease protein oppB] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell81495 :g2632015] [LN : BSAJ2 571] [AC : AJ002571] 
[PN:DppC] [GN:dppC] [FN: ABC -type dipeptide transport system] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis 168 56 kb DNA fragment between xlyA and ykoR.] 
[LE:14301] [RE:15263] [DI: direct] >gp : [GI :ell833 14 :g2 633648] [LN : BSUB0007] 
[AC:Z99110 :AL009126] [PN : dipeptide ABC transporter (permease)] [GN:dppC] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 7 of 21) : 
from 1194391to 1411140.] [NT : alternate gene name: dciAC] [LE:167250] [RE:168212] 
[DI:direct] 



NT AA 

QRF Name NTID AAID — — , , score probability 
LengthLength £ - 

AI7B03000991_22272S83_t3_291 | [1973 | [ 5745 |p3~| | 40 | |86 | j0 . 0040 



Description 

gp: [GI:g4098413] [LN: LOU77495] [AC:U77495] [PN:putative integrase] [GN:int] 
[OR:Leuconostoc oenos bacteriophage 10MC] [DB : genpept-phg] [DE : Leuconostoc oenos 
bacteriophage 10MC putative integrase (int) gene , complete cds . ] [NT : site-specif ic 
recombinase] [LE:119] [RE: 1165] [DI: direct] 

NT AA 

QRF Name NTID AAID — i _ 1 _ T — , , score probabilit y 
LengthLength *■ 



AI7503000991__22455512_t3_28g | [1974 | | 574€ [pT7~| | 48 | 



Description 
NO-HIT 



463 



NT AA 

QRF Name *™ AAID Le - thLe - th ^ore probability 

|A17b03000991_22462787_cl_392 | [1975 | [5 7 4 7 | pB~| | 1 7 4 | |S35— 1| 1 . Be-51 | 

Description 

sp: [LN:LEPH_STAAU] [AC:P72364] [GNrSPSA] [OR : STAPHYLOCOCCUS AUREUS] [DE : INACTIVE SIGNAL 
PEPTIDASE IA] [SP:P72364] [DB : swissprot] >gp : [GI :gl595809] [LN : SAU65000] [AC:U65000] 
[PN:type-I signal peptidase SpsA] [GN:spsA] [OR : Staphylococcus aureus] 
[DB :genpept-bct2] [DE : Staphylococcus aureus type-I signal peptidase SpsA (spsA) gene, 
andtype-I signal peptidase SpsB (spsB) gene, complete cds.] [NT: inactive signal 
peptidase homologue; protein lacks] [LE:40] [RE: 564] [DI: direct] 



NT AA 

QRF Name NT ID AAID — - — , , score probability 

Length Length £ - 



AI75u3000991_226g5887_c3_598 | [1976 | [5748 |p3~~] | 42 | | 120 |[ 2.4e-07 



Description 

pir: [LN:D69838] [AC:D69838 ] [PN : conserved hypothetical protein yisU] [GN:yisU ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83088 :g2633422] [LN : BSUB0006] 
[AC:Z99109:AL009126] [GN:yisU] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to hypothetical proteins] [LE: 165981] [RE: 166643] [DI : complement] 
>gp: [GI:ell73520:g2145387] [LN: BSY09476] [AC:Y09476] [PN:YisU] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis 54kb genomic DNA fragment.] [NT : putative] [LE:23207] 
[RE: 23869] [DI : complement ] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IAI7B03000991 22703588 F2 137 



TTTT 



| b749 || 954 ~| pi7 | pi~ | |6 . Oe-ff TT 



1.-] 



Description 

sp : [LN : MENA_BACSU] [AC:P39582] [GN : MENA : IPA-6D] [OR:BACILLUS SUBTILIS] [EC:2.5. 
[DE: (DHNA-OCTAPRENYLTRANSFERASE) ] [SP:P39582] [DB : swissprot] >pir: [LN:S39661] 
[AC:S39661:B70050 ] [PN : menaquinone biosynthesis protein homolog ywaB:protein ipa-6d] 
[GN : ywaB ] [CL:quinone biosynthesis homolog ywaB] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI:g413930] [LN : BSGENR] [AC:X73124] [GN:ipa-6d] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic region (325 to 333).] [SP:P39582] [LE:5904] 
[RE:6839] [DI:direct] >gp : [GI : ell86348 : g26363 84] [LN : BSUB002 0] [AC : Z99123 : AL009126] 
[GN : ywaB] [FN: unknown] [OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis 
complete genome (section 20 of 21) : from 3798401to 4010550.] [NT : alternate gene name: 
ipa-6d; similar to quinone] [SP:P39582] [LE:151120] [RE:152055] [DI : complement] 



ORF Name 



NTID 



1AI7503000991 23438827 c3 556 



TTTW 



Description 
NO-HIT 



tv^x^ NT AA 

AAID — — score 

LengthLength 

|5750 ||129 | |42 | 



probability 



ORF Name 



NT 



AA 



NTID AAID — , _ 

LengthLength 



score 



IAI7S03000991 23B3S910 cl 391 



j [1979 | [5751 |pn~|[200 Im- 



probability 
9.8e-48 



Description 

gp: [GI:g4185300] [LN : AF089862] [AC : AF089862 ] [PN:unknown] [OR : Staphylococcus carnosus] 
[DB:genpept-bct2] [DE : Staphylococcus carnosus type-I signal peptidase SipA (sipA) 
andtype-I signal peptidase SipB (sipB) genes, complete cds; andunknown gene.] [NTrOrfl; 
similar to Bacillus subtilis yhjE] [LE:271] [RE: 948] [DI: direct] 



464 



ORF Name 



NTID 



NT AA 
AAID — _ T — , , score 
Length Length 



probability 



|AI750300099i_23b93932_t2_166 

Description 

NO -HIT 



^ [1980 | [5752 ||126 | [41 | 



ORF Name 



NTID AAID , 



E 



7503000991 23611553 c3 600 



Description 
NO-HIT 

I ! lllu Wi 1 ll r n. " , =g 

ORF Name 



][ 



T93T 



NT AA 
LengthLength 
[5753 |P^~| 



score probability 



NT AA 
NTID AAID — — _ score 
LengthLength 



probability 



17503000991 23625637 c2 525 



Description 
NO-HIT 



j [1982 | [5754 |[TF3 | |6" 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



— . , score probability 



LAI7503000991 23634641 cl 398 



1983 | [5755 ||1248^ [415 | [1475 ||3 . 7e-151 



Description 

pir : [LN:G69842] [AC:G69842 ] [PN : 3 -oxoacyl - [acyl -carrier-protein] synthase,] [GNryjaY ] 
[CL: 3-oxoacyl- [acyl-carrier-protein] synthase I: 3-oxoacyl- [acyl-carrier-protein] 
synthase I homology] [OR:Bacillus subtilis] [EC : 2 . 3 . 1 . 41] [DB:pir2] 

>gp: [GI:ell83137 :g2633471] [LN :BSUB0006] [AC : Z99109 : AL009126 ] [GN:yjaY] [FNiunknown] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 
6 of 21): from 999501 tol209940.] [NT:similar to 3-oxoacyl- acyl-carrier protein] 
[LE:209150] [RE:210391] [DI:direct] >gp : [GI : ell83154 : g2633488] [LN: BSUB0007] 
[AC:Z99110:AL009126] [GN:yjaY] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 7 of 21): from 1194391to 1411140.] 
[NT:similar to 3-oxoacyl- acyl-carrier protein] [LE:14260] [RE:15501] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IAI7503000991 23678800 c3 561 



[5756 |p^~|[132 |p^5" 



probability 
1.0e-36 



Description 

pir : [LN:H69841] [AC:H69841 ] [PN : conserved hypothetical protein yitW] [GN:yitW ] 
[CL: conserved hypothetical protein MJ1129] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:e261991:gl620928] [LN: BS168NPRB] [AC:Z79580] [GN:putative orf] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE : B . subtilis nprB gene.] [LE:6256] [RE:6564] 
[DI:direct] >gp: [GI :ell831 18 :g2 633452] [LN : BSUB0006] [AC : Z99109 : AL009126] [GN:yitW] 
[FN: unknown] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete 

genome (section 6 of 21) : from 999501 tol209940.] [NT: similar to hypothetical proteins] 
[LE:192227] [RE:192535] [DI:direct] >gp : [GI : ell73550 : g2145417] [LN : BSY09476] 
[AC:Y09476] [PN:YitW] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis 54kb 

genomic DNA fragment.] [NT:putative orf] [LE:49453] [RE:49761] [DI:direct] 



ORF Name 



NTID 



[A17503000991_2392300_i2__176 

Description 

NO-HIT 



AAID 



NT 



AA 



LengthLength 



score probability 



] |1985 [ [5757 ||138 1 145 | 



465 



ORF Name 

|AI750300099i_24022191J:2J>32 



NT 



AA 



NTID AAID L - thLe - h gcore probability 
[1986 | [5758 ||222 1 [73 [[37 || 3.9e-0b 



Description 

pir: [LN:G69854] [AC:G69854 ] [PN : hypothetical protein yj zD] [GNiyjzD ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell83130 :g2633464] [LN: BSUB0006] [AC : Z99109 : AL009126] 

[GNryjzD] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis 
complete genome (section 6 of 21): from 999501 tol209940.] [LE:204697] [RE:204882] 

[DI: complement] >gp: [GI : el 183 14 7 :g2 633481] [LN: BSUB0007] [AC : Z99110 : AL009126] [GN : yj zD] 

[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 7 of 21): from 1194391to 1411140.] [LE:9807] [RE:9992] [DI : complement] 
>gp: [GI:dl023798:g2564027] [LN:D86376] [AC:D86376] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (strain:CU741) DNA] [DB :genpept-bctl] [DE:Bacillus subtilis ArgF and med 
genes, partial and complete cds . ] [NT: unnamed protein product] [LE:1027] [RE: 1212] 

[DI : complement] 



ORF Name 



NTID 



NT 

AAID . — 



AA 



LengthLength 



score 



probability 



IAS7503000991 24024142 c2 496 



[1987 | [5759 |pi5~ 1 1104 |p ||0.000i0 



Description 

gp: [GI:g2654481] [LN : BFU89914 ] [AC:U89914] [PN : hypothetical 10.1 kDa protein] 
[OR:Bacillus firmus] [DB :genpept-bct2] [DE:Bacillus firmus hypothetical 34.0 kDa 
protein, hypothetical 8.9 kDaprotein, hypothetical 10.1 kDa protein, hypothetical 21.0 
kDaprotein, putative thiosulfate sulfurtransf erase, hypothetical 16.1kDa 
transcriptional regulator and hypothetical 18.2 kDa proteingenes , complete cds, and 
Na+/H+ antiporter homolog gene, partialcds.] [NT:orf4] [LE:1601] [RE:1882] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — __ — _ score 
Length Length 



probability 



IAI7503000991 24070137 c3 502 



J [1988 | [5760 ||150"~[|49 | 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000991 24105342 cl 448 



|1989 | [5751 |pT-||275 | [1242 ||i . 8e-125 



Description 

pir: [LN:F69656] [AC : F69656 : A42715 : S27512 ] [PN : naphthoate synthase, menB : DHNA 
synthase : dihydroxynaphthoate synthase: dihydroxynapthoic acid synthetase 
menB :menaquinone biosynthesis enzyme/enoyl CoA hydratase homolog] [GN:menB ] 
[CL: naphthoate synthase : enoyl- CoA hydratase homology] [OR:Bacillus subtilis] 
[EC:4.1.3.36] [DB:pir2] >gp : [GI : el 185953 :g2 635564] [LN : BSUB0016] [AC : Z99119 : AL009126] 
[PN: dihydroxynapthoic acid synthetase] [GN:menB] [FN:menaquinone biosynthesis] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [EC : 4 . 1 . 3 . 36] [DE:Bacillus subtilis complete 
genome (section 16 of 21): from 2997771to 3213410.] [LE: 150211] [RE:151026] 
[DI: complement] >gp: [GI : g2293148] [LN: AF008220] [AC : AF008220] [PN : dihydroxynaphthoate 
synthase] [GN:menB] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 
rrnB-dnaB genomic region.] [LE: 29401] [RE: 30216] [DI -.direct] 



NT AA 

ORF Name NTID AAID — — , , scor e probability 
LengthLength *- 



IAI7503000991 24225010 c2 536 



Description 
NO-HIT 



[1990 | [5762 || 168 ] [ 55 | 



466 



ORF Name 

|AI7503000991J*427045i_ci_432 



Description 
NO-HIT 



NTID 
"J [1991 



NT AA 
AAID — . _ — , . score 
LengthLength 

[5753 [|135 | [44 [ 



probability 



ORF Name 



NT 

NTID AAID . 



AA 



LengthLength 



score 



IAI7503000991 24409428 ci 438 



J [1992 [ [5754 jpsa |p [ p 



probability 
i.ie-32 



Description 

gp: [GI:gl731452] [LN : SAU38429] [AC:U38429] [PN : recombination protein] [GN:pre] 
[OR: Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus chloramphenicol 
resistance plasmid pKH7 , complete sequence.] [LE:2074] [RE: 3285] [DI: direct] 



ORF Name 



NTID 



^™ NT AA 

AAID — _ T — . . score 

LengthLength 



probability 



IAI7503000991 24412517 c3 581 



[5755 |pi~||130 [|213 ||1.2e-16 



Description 

sp: [LN:G6PD_BACSU] [AC:P54547] [GN:ZWF] [OR:BACILLUS SUBTILIS] [EC : 1 . 1 . 1 .49] 
[DE: PROTEIN 11) (VEG11) ] [SP:P54547] [DB : swissprot] >pir : [LN : B69964 ] [AC:B69964 ] 
[PN: probable glucose- 6 -phosphate 1- dehydrogenase, yqjJ] [GN:yqjJ ] 

[CL:glucose-6-phosphate dehydrogenase] [OR:Bacillus subtilis] [EC : 1 . 1 . 1 . 49] [DB:pir2] 
>gp: [GI:dl013296:gl303961] [LN : BACJH642] [AC : D84432 : D82370] [PN:YqjJ] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 239025] 
[RE:240494] [DI : complement] >gp: [GI : ell85654 :g2634820] [LN: BSUB0013] 

[AC:Z99116:AL009126] [GN:yqjJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT: similar to glucose- 6 -phosphate 1 -dehydrogenase] [SP:P54547] [LE: 83125] [RE: 84594] 
[DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



AI7503000991 24415632 cl 393 



[5755 |[T7^~||192 |p" 



probability 
1.7e-82 



Description 

Sp: [LN : LEP_STAAU] [AC:P72365] [GN:SPSB] [OR : STAPHYLOCOCCUS AUREUS] [EC : 3 . 4 . 21 . 89] 
[DE : SIGNAL PEPTIDASE IB, (SPASE IB) (LEADER PEPTIDASE IB)] [SP:P72365] [DB : swissprot] 
>gp: [GI:gl595810] [LN : SAU65000] [AC:U65000] [PN:type-I signal peptidase SpsB] [GN:spsB] 
[OR staphylococcus aureus] [DB : genpept -bct2 ] [DE : Staphylococcus aureus type-I signal 
peptidase SpsA (spsA) gene, andtype-I signal peptidase SpsB (spsB) gene, complete cds . ] 
[NT: signal peptidase, leader peptidase, serine] [LE:580] [RE: 1155] [DI: direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



^7503000991 24415702 12 125 



J 



probability 
[5767 || 4005 1| 1335 [ |7001 || 0.0 



] 



Description 

gp: [GI:g2267242] [LN : SEU71377] [AC:U71377] [PN : autolysin AtlE] [OR : Staphylococcus 
epidermidis] [DB : genpept-bctl] [DE : Staphylococcus epidermidis autolysin AtlE and 
putativetranscriptional regulator AtlR genes, complete cds.] [NT: primary attachment to 
a polystyrene surface] [LE:2620] [RE:6627] [DI:direct] 



ORF Name 



NTID 



F 



17503000991 24428543 13 297 



J 



AAID 



NT 



AA 



LengthLength 



[5768 |[T71 | [5T" 



score probability 
uT0T7 



7T 



Description 

gp: [GI:gl53715] [LN : STRMLIKEE] [AC:L05021] 
pyogenes] [SR : Streptococcus pyogenes DNA] 
M-like protein gene, partial cds.] [LE:<1] 



[PN:M-like protein] [OR : Streptococcus 
[DB: genpept-bctl] [DE : Streptococcus pyogenes 
[RE:>243] [DI:direct] 



467 



ORF Name 

|Ai7503000991_2447b252_i:3_289 



NT 



AA 



Description 
NO-HIT 



^ MID L - thLe - th BC°re probability 
[1997 | [5759 |P^~] \ 52 | 



ORF Name 



NT ID 



NT AA 
AAID — , — , , score 
LengthLength 



AI7503000991 24646886 C3 594 



[5770 |p^T"[ |5i8 |p^" 



probability 
2.0e-23 



Description 

pir: [LN:B71973] [AC:B71973 ] [PN: 2 ■ , 3 1 -cyclic-nucleotide 2 ' -phosphodiesterase] [GNicpdB 
] [CL:2 1 , 3 • -cyclic-nucleotide 2 ' -phosphodiesterase : 2 ' , 3 ' -cyclic-nucleotide 
2 ■ -phosphodiesterase homology iphosphoest erase core homology] [OR : Helicobacter pylori] 
[SR:strain J99, , strain J99] [SR:strain J99, ] [DB:pir2] >gp : [GI : g4154615] 
[LN:AE001449] [AC : AE001449 :AE001439] [PN:2 1 , 3 1 -CYCLIC-NUCLEOTIDE 2 1 - PHOSPHODIESTERASE] 
[GN : cpdB] [OR Helicobacter pylori J99] [DB : genpept -bct2 ] [DE : Helicobacter pylori, 
strain J99 section 10 of 132 of the completegenome . ] [NT: similar to H. pylori 26695 
gene HP0104] [LE:139] [RE: 1884] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID — — 
LengthLength 



IAI7503000991 24648412 cl 406 



|5771 ||542 



2TT 



score probability 
683 ||3.1e-67 



Description 

pir: [LN:E69844] [AC:E69844 ] [PN:GTP pyrophosphokinase homolog yjbM] [GNtyjbM ] [CL:GTP 
pyrophosphokinase related protein] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell83l80:g2633514] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbM] [FN:unknown] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to GTP pyrophosphokinase] [LE:42087] 
[RE:42722] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — ^,_ T — , , sco re 
LengthLength 



probability 



F 



I750300099I 24745437 c3 539 



] [2000 | [5772 |pSTj|420 | p^-||i . 2e-170 



Description 

sp: [LN:YPCAJBACSU] [AC:P50735] [GN : YPCA] [OR: BACILLUS SUBTILIS] [EC:1.4.1.-] [DE:(EC 
1.4.1.-)] [SP:P50735] [DB : swissprot ] >pir : [LN :G69933] [AC:G69933 ] [PN : glutamate 
dehydrogenase homolog ypcA] [GN : ypcA ] [CL : glutamate dehydrogenase (NAD(P)+)] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83741 :g2634714] [LN: BSUB0012] 
[AC:Z99115:AL009126] [GN:ypcA] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept -bet 1] 
[DE:Bacillus subtilis complete genome (section 12 of 21): from 2195541to 2409220.] 
[NT:similar to glutamate dehydrogenase] [SP:P50735] [LE:205764] [RE:207044] 
[DI: complement] >gp: [GI :ell85565:g2 634731] [LN : BSUB0013] [AC : Z99116 : AL009126] [GN: ypcA] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept -be tl] [DE:Bacillus subtilis complete 
genome (section 13 of 21): from 2395261to 2613730.] [NT:similar to glutamate 
dehydrogenase] [SP:P50735] [LE:6044] [RE:7324] [DI : complement] >gp : [GI : gll46206] 
[LN : BACSERA] [AC:L47648] [PN : glutamate dehydrogenase] [GN :ypcA] [OR:Bacillus subtilis] 
[DB : genpept -bct2] [DE:Bacillus subtilis phosphoglycerate dehydrogenase (serA) , 
ypaA, ferredoxin (fer) , ypbB, recS, ypbD, ypbE, ypbF, ypbG, ypbH, glutamate dehydrogenase 
(ypcA) , ypdA, ypdB, ypdC, spore cortexlytic enzyme (sleB) , ypeB, ypfA, ypfB, cytidine 
monophosphatekinase (cmk) , ypfD, ypgA, yphA, yphB, yphC, NAD+ 

dependentglycerol- 3 -phosphate dehydrogenase (glyc) , yphE and yphF genes, complete cds.] 
[NT: similar to Clostridium difficile glutamate] [LE:9353] [RE: 10633] [DI: direct] 



468 



NT AA 

ORF Name NT ID AAID — _ T — _ score probability 
Length Length e 

|AI7503000991_24804(b77_c3_b97 | | 2001 | [5773 1 | pT7~[| l . 5e-06 

Description 

gp: [GI:g2541936] [LN : PSU2 7310] [AC:U27310] [PN:unknown] [OR : Pseudomonas syringae] 
[DB:genpept-bct2] [DE : Pseudomonas syringae phaseolotoxin gene cluster, complete 
sequence.] [NT : ORF6 ; similar to Pseudomonas syringae fatty acid] [LE:5079] [RE: 6062] 
[DI: direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



|&I75030009Si_25392fl25_t2_i75 | |20u2 | [5774 || ii4 5"| | 361 | pT7~| |7 . 0e-b4 — | 

Description 

pir: [LN:G70007] [AC:G70007 ] [PN : conserved hypothetical protein yueF] [GN:yueF ] 
[CL: Bacillus subtilis conserved hypothetical protein yueF] [OR: Bacillus subtilis] 
[DB:pir2] >gp: [GI : ell84257 :g2635675] [LN: BSUB0017] [AC : 299120 : AL009126] [GN : yueF] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete 

genome (section 17 of 21): from 3197001to 3414420.] [NT:similar to hypothetical 

proteins] [LE: 64687] [RE: 65796] [DI : complement] 

NT AA 

ORF Name NTID AAID t — ^ T — _ score probabil ity 
Length Length c dL 



|A17b03000991_2S429S12_c3_593 | [2003 | [5775 |p52"| | 463 | [TTT0-|| 1 . 3e-114 

Description 

pir : [LN:H69862] [AC:H69862 ] [PN : Na+- transporting ATP synthase homolog ykrM] [GN:ykrM ] 
[CL:Na+-ATP synthase chain J] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84940:g2633721] [LN : BSUB0008] [AC : Z99111 : AL009126] [GN:ykrM] [FN:unknown] 
[OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 

8 of 21): from 1394791to 1603020.] [NT:similar to Na+- transporting ATP synthase] 
[LE:20743] [RE:22092] [DI:direct] 



NT AA 

ORF Name NTID AAID Le ~ tnLe ~ tn score probability 



Ar7b03000991_25430316_c2_522 [ [2004 | [5776 j[T7T~] J55 | p771~|| 5 . 6e-li — [ 



Description 

pir: [LN: QQSA8T] [AC:A04493 ] [PN : hypothetical protein B-189] [OR : Staphylococcus aureus] 
[DB:pirl] 

NT AA 

ORF Name NTID AAID — _ — , , score pro bability 

LengthLength L 



A17603000991_26178400_c3_B42 | [ 2005 | [ 5777 || 186 1 1 61 ||7I 1| 0 . 042 



Description 

Sp : [ LN : AOPP_HUMAN] [AC:P30044] [OR: HOMO SAPIENS] [SR:,HUMAN] [DE:71B)] [SP:P30044] 
[DB:swissprot] 

NT AA 

ORF Name NTID AAID — , — , score probability 
LengthLength 

|A17b03000991_26212787J:3_33B | [2005 | [ 5778 |p2~| | 203 | pu^~|| l . 7e-27 — | 

Description 

pir: [LN:C69844] [AC:C69844 ] [PN : hypothetical protein yjbK] [GN:yjbK ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : el 183178 :g2 633512] [LN : BSUB0007] [AC : Z99110 : AL009126] 

[GN:yjbK] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:40993] [RE:41565] 

[DI : complement] 



469 



ORF Name 



NTID 



NT 



AA 



AAID . m „ 



LenqthLength 



score 



probability 



|AI7503000991_26377J40__c2_488 



] [2007 | [5779 ||999 | [332 | pW~[|4 . 5e-15 



Description 

gp: [GI:e257629:gl771202] [LN: LLLVSFPEP] [AC:X99710] [PN : transcription factor] 
[ORrLactococcus lactis] [DB : genpept-bctl] [DE:L.lactis ORF, genes homologous to vsf-1 
and pepF2 and gene encodingprotein homologous to methyl transf erase . ] [NT: weak homology 
with vsf-1 gene (X73635)] [LE:934] [RE: 1917] [DI: direct] 



ORF Name 



UC7S03000S91 26564012 c2 477 



NTID 



score 



NT AA 

AAID 

Length Length 

] [2008 | [5780 ipi^j pi ||2649 |fT 



probability 
5e-275 



Description 

gp: [GI:g3150046] [LN : AF016634] [AC : AF016634 ] [PN:ClpB chaperone homolog] [GN:clpB] 
[OR:Lactococcus lactis subsp . cremoris] [DB : genpept-bct2] [DE : Lactococcus lactis 

cremoris ClpB chaperone homolog (clpB) andphosphoribosylf ormylglycinamide cyclo-ligase 
(pur5) genes, completecds; and phosphor ibosylglycinamide formyltransf erase (pur3) 

gene,partial cds.] [LE:183] [RE:2786] [DI:direct] 



ORF Name 



|AI7503 000991_26 56 7557_C3_5b8 



Description 
NO-HIT 



NTID 



„ T „ NT AA 

AAID — , T — , , score 

LengthLength 



5781 11138 I Im- 



probability 



ORF Name 



NT AA 
NTID AAID — ^. — score 
Length Length 



probability 



pnrr 



S03000991 26604662 c2 495 



Description 
NO-HIT 



] |20io | |5782 ||630 | pW 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



1AI7503000991 26775637 ci 447 



][ 



[5783 |p55-||462 | p5~ ||2 . 5e-58 



Description 

sp : [LN : MENF_BACSU] [AC : P23973 : P23972 ] [GN : MENF : ICSM] [OR:BACILLUS SUBTILIS] 
[EC:5.4.99.6] [DE : MENAQUINONE- SPECIFIC ISOCHORISMATE SYNTHASE,] [SP : P23973 : P23972] 
[DB:swissprot] >pir : [LN : A69657] [AC :A69657 : S27507 : S27508 : 139883 ] [PN:probable 
isochorismate synthase, menaquinone- specif ic menF] [GN:menF ] [CL : isochorismate 
synthase] [OR:Bacillus subtilis] [EC : 5 . 4 . 99 . 6] [DB:pir2] >gp : [GI : ell85956 :g2635567] 
[LN:BSUB0016] [AC : Z99119 : AL009126] [PN : menaquinone- specif ic isochorismate synthase] 
[GN:menF] [FN : menaquinone biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC:5.4.99.6] [DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 
3213410.] [NT:alternate gene name: icsM, entC] [SP:P23973] [LE:153612] [RE:155027] 
[DI: complement] >gp: [GI :g2293145] [LN : AF008220] [AC : AF008220] [PN : isochorismate 
synthase] [GN:menF] [OR:Bacillus subtilis] [DB :genpept-bct2] [DE:Bacillus subtilis 
rrnB-dnaB genomic region.] [LE:25400] [RE:26815] [DI:direct] >gp : [GI : gll85288] 
[LN : BACMENAQOP] [AC :M74538 :M74182 :M74183] [PN : isochorismate synthase] [GN:menF] 
[OR:Bacillus subtilis] [SR:Bacillus subtilis (strain RBI) DNA] [DB : genpept - be 1 2 ] 
[DE: Bacillus subtilis menaquinone operon: menF, menD, menB and menEgenes, complete 
cds.] [NT:based on similarity to E. coli EntC, A. hydrophila] [LE:143] [RE: 1558] 
[DI : direct] 



ORF Name 



NT AA 
NTI D AAID — _ — t score 
LengthLength 



probability 



AI7503000991_2848308_c3_607 

Description 

NO-HIT 



j [2012 | [5784 |44 | 



470 



ORF Name 



NT ID 



NT AA 
AAID — — , , score 
LengthLength 



2013 | [5785 |pW|p |p~ 



probability 
10.0019 



] 



|A.L750300099i_289802J:2_227 
Description 

pir: [LN:F22845] [AC:F22845 ] [PN : hypothetical protein 6] [OR : mitochondrion Trypanosoma 
brucei] [DB:pir2] 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



probability 



^17503000991 2922250 c3 574 



2014 | [ 5785 |pgT"| |127 | |I73~| |2 . 7e-13 



Description 

pir: [LN:D69844] [AC:D69844 ] [PN : hypothetical protein yjbL] [GN.-yjbL ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell83179 :g2633513] [LN: BSUB0007] [AC : Z99110 : AL009126] 
[GN:yjbL] [FNrunknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:41690] [RE:42058] 
[DI: direct] 



ORF Name 



|A17b03000991 29335888 c3 562 



Description 
NO-HIT 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



] [2015 | [5787 ||135""||44 | 



ORF Name 



NTID 



AAID 



^17503000991 29570302 tl 24 



Description 
NO-HIT 



][ 



NT AA 
LengthLength 
[5788 ||T32 1 143 



score probability 



ORF Name 



NT AA 
NTID AAID — , — , score 
LengthLength 



probability 



^17503000991 31281253 c3 538 



][ 



TuTT 



[5789 |p?T]|406 | [1513 ||3 . 5e-155 



Description 

sp: [LN:OAT_BACSU] [AC:P38021] [GN : ROCD] [OR:BACILLUS SUBTILIS] [EC : 2 . 6 . 1 . 13] 
[DE : AMINOTRANSFERASE) ] [SP:P38021] [DB : swissprot ] >pir: [LN:S55793] 

[AC:S55793:S49267:B53370:H69693 ] [PN : ornithine- -oxo-acid transaminase, rocD : ornithine 
aminotransf erase : ornithine- -oxo-acid aminotransferase] [GN : rocD ] 

[CL: ornithine- -oxo-acid aminotransferase] [OR: Bacillus subtilis] [EC : 2 . 6 . 1 . 13] 

[DB:pir2] >gp: [GI : dl011955 :gl064807] [LN : BACGNTZA] [AC:D78193] [PN : orthinine 
aminotransferase] [GN : rocD] [OR: Bacillus subtilis] [SR: Bacillus subtilis (strain: 168) 
DNA] [DB : genpept-bctl] [DE:Bacillus subtilis 36kb sequence between gntZ and trnY 
genesencoding 34 ORFs . ] [LE:25845] [RE:27050] [DI : complement] >gp : [GI : g550311] 

[LN : BSROCDEF] [AC:X81802] [PN : ornithine< aminotransferase] [GN : rocD] [OR:Bacillus 
subtilis] [DB : genpept-bctl] [DE :B . subtilis rocD, rocE and rocF genes.] [SP:P38021] 

[LE:241] [RE:1446] [DI: direct] >gp: [GI :ell84760:g2 636581] [LN: BSUB0021] 

[AC:Z99124 :AL009126] [PN : ornithine aminotransferase] [GN : rocD] [FN:arginine and 
ornithine utilization] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 2 . 6 . 1 . 13] 

[DE:Bacillus subtilis complete genome (section 21 of 21): from 3999281to 4214814.] 

[SP:P38021] [LE:144229] [RE:145434] [DI : complement ] 



ORF Name 



NT AA 
NTI D AAID — , — , score 
LengthLength 



probability 



AI7503000991_31300807_cl_436 

Description 

NO-HIT 



] [2018 | [5790 1|51 | 



471 



ORF Name 



NT AA 
NTID AAID — , — x , scor e 
LengthLength 



probability 



E 



17b0300099i 3ib2b260 c3 579 



Description 
NO-HIT 



2 [2019 | [5791 |p3Y~~| I 44 | 



ORF Name 



NTID 



AAID 



^7503000991 32062875 ci 439 



is 



NT AA 
Length Length 



score 



I F IPH f 



probability 
3.8e-30 



Description 

gp: [GI :gl731452] [LN: SAU38429] [AC:U38429] [PN : recombination protein] [GN:pre] 
[OR: Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus chloramphenicol 
resistance plasmid pKH7 , complete sequence.] [LE:2074] [RE: 3285] [DI: direct] 



ORF Name 



NTID 



NT 

AAID _ 



AA 



LengthLength 



score probability 



IA17503000991 32067937 t2 247 



[2021 | [ 5793 |pT5T]| 454 | [1790 || 1 . be-184 
Description 

gp: [GI:g2792490] [LN : AF041467] [AC : AF041467] [PN: coenzyme A disulfide reductase] 
[FN:pyridine nucleotide disulfide oxidoreductase] [OR: Staphylococcus aureus] 
[DB:genpept-bct2] [DE : Staphylococcus aureus coenzyme A disulfide reductase gene, 
completecds . ] [NT : CoADR ; flavoenzyme (FAD); single catalytic cysteine] [LE:66] 
[RE: 1382] [DI:direct] 



ORF Name 



LAI7503000991 32132183 ci 455 



Description 
NO-HIT 



NT AA 
NTID AAID — , — , score 
LengthLength 

[2022 | [ 5794 |p2~| |43 | 



probability 



ORF Name 



NTID 



NT 



AA 



1AI7503000991 32457312 c2 501 



AAID — 

LengthLength 

[5795 |[^~| 



score 



probability 



i.5e-19 



Description 

gp: [GI:e303881:gl850807] [LN:CPCPEAA] [AC:X71844] [PN:putative transposase] 
[OR: Clostridium perfringens] [DB : genpept-bctl] [DE : C . perf ringens uapC, cpe, and nadC 
genes.] [LE:2477] [RE:2932] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7503000991 32635937 cl 429 



"j [2024 | [5796 |p^uT] |500 | 



probability 
14.1e-232 



Description 

gp: [GI:el301684 :g3256223] [LN : SAY14370] [AC:Y14370] [PN : UDP-N-acetylmuramyl - tripeptide 
synthetase] [GN:murE] [OR : Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus 
aureus RF3, murE, ypfP genes.] [LE:3244] [RE: 4722] [DI : complement] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score 



probability 



117503000991 33364067 cl 431 



[2025 | [579 7 | p^T] [ 593 || 542 ||2.7e-b2 1 

Description 

gp: [GI:g2109443] [LN : SPDNAARG] [AC : AF000658] [PN:putative serine protease] [GN:sphtra] 
[OR: Streptococcus pneumoniae] [DB : genpept-bct2] [DE : Streptococcus pneumoniae R801 
tRNA-Arg gene, partial sequence, andputative serine protease (sphtra) , SPSpoJ (spspoJ) , 
initiatorprotein (spdnaa) and beta subunit of DNA polymerase III (spdnan) genes , 
complete cdsj [NT:SPHtra] [LE:765] [RE: 1958] [DI: direct] 



472 



ORF Name 



NT AA 
NT ID AAID — — , , score 
LengthLength 



probability 



U 7 503000991 3371067 c2 482 



] [2025 | |5798 — 1 1317 | p3T] |2 . 5e-104 



] 



Description 

pir: [LN:E38447] [AC : E38447 : S15234 : C6 9669 ] [PN : oligopeptide transport ATP-binding 
protein oppF : sporulation initiation protein spoOKE] [GN: spoOKE :oppF ] [CL: inner 
membrane protein malK: ATP-binding cassette homology] [OR: Bacillus subtilis] [DB:pir2] 
>gp: [GI:gl43608] [LN : BACSPOOK] [AC:M57689] [PN : sporulation protein] [GN : spoOKE] 

[OR:Bacillus subtilis] [SR : B . subtilis (strain JH642) DNA, clones pDR20/21, pJL2/3 and 
pJL7] [DB:genpept-bctl] [DE : Bacillus subtilis spoOK operon.] [LE:5152] [RE: 6078] 

[DI: direct] 



ORF Name 



NT ID 



7V7VTr , NT AA 

AAID — , — .score 

LengthLength 



E 



17503000991 337109S8 t2 124 



|5799 [j¥53 [ jT^T 



7TT 



probability 
i.7e-73 



Description 

gp: [GI:g2267241] [LN : SEU71377] [AC:U71377] [OR : Staphylococcus epidermidis] 
[DB :genpept-bctl] [DE : Staphylococcus epidermidis autolysin AtlE and 

putativetranscriptional regulator AtlR genes, complete cds . ] [NT : ORF3] [LE:1957] 
[RE:2379] [DI :direct] 



ORF Name 



LAI7503000991 3394540 t2 138 



NT AA 
NT ID AAID — — score 
LengthLength 

[2028 | |5800 |pT2 | |103 



probability 



Description 
NO-HIT 

ORF Name 



NT ID 



_ _ __ NT AA 

AAID — 4_ i_ t — , , score 

Length Length 



probability 



IAI7503000991 34006551 t3 312 



[5801 j|T5u | 



Description 
NO-HIT 



ORF Name 



NT ID AAID — — __ score 
LengthLength 



IAI7503000991 34062930 cl 440 



TuTO" 



5802 11273 



probability 
i.4e-25 



Description 

sp : [LN : PRE3_STAAU] [AC:P03864] [OR : STAPHYLOCOCCUS AUREUS] [DE : PLASMID RECOMBINATION 
ENZYME (MOBILIZATION PROTEIN) ] [SP:P03864] [DB : swissprot] >gp : [GI :gl51683] [LN:PT1CG] 
[AC: JO 1764 : J01765] [OR:Plasmid pT181] [SR:Plasmid pT181 DNA from Staphylococcus aureus] 
[DB:genpept-bctl] [DE:Plasmid pT181, complete genome.] [NT:Pre protein (plasmid 
recombination)] [LE:2521] [RE: 3762] [DI: direct] 



ORF Name 



NT 



AA 



NTID AAID — — , , score prob ability 
LengthLength -L 



1AI7503000991 34073552 c2 509 



][ 



2031 



[5803 |piT||5i4 |p5T]|i.3e-237 



Description 

gp: [GI :el301683 :g3256222] [LN : SAY14370] [AC:Y14370] [PN:peptide chain release factor 3] 
[GN:RF3] [OR : Staphylococcus aureus] [DB : genpept-bctl] [DE : Staphylococcus aureus RF3 , 
murE, ypfP genes.] [LE:1435] [RE:3000] [D I : complement] 



473 



ORF Name 



NT ID 



NT AA 
AAID — _ T — score 
LengthLength 



1A1 7 503000991 341635S2 t3 322 



"2TOT" 



[ 5804 | [TOS — 1 1 161 |pT- 



probability 
1.0e-29 



Description 

pir: [LN:D69583] [AC:D69583 ] [PNralcohol dehydrogenase, adhB] [GN:adhB ] [CL:alcohol 
dehydrogenase : long- chain alcohol dehydrogenase homology] [OR:Bacillus subtilis] 
[EC:1. 1.1.1] [DB:pir2] >gp : [GI : e209890 :g2108273] [LN : BS233DEG] [AC : X92868 :X79978] 
[ PN : NAD alcohol dehydrogenase] [GN:adhB] [OR: Bacillus subtilis] [DB : genpep t - be 1 1 ] 
[DE :B. subtilis 23.9kb fragment from map position 233 degrees on thechromosome . ] 
[LE:13725] [RE:14861] [DI : complement] >gp: [GI : ell83926 :g2635142] [LN:BSUB0014] 
[AC:Z99117:AL009126] [PN:alcohol dehydrogenase] [GN:adhB] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis complete genome (section 14 of 21) : from 
2599451to 2812870.] [LE:153270] [RE:154406] [DI: direct] 



ORF Name 



XTmTT , NT AA 

NT ID AAID — . — , , score 

LengthLength 



probability 



M750300099i_34179031_t2_122 

Description 

NO-HIT 



2033 | [5805 |pT5~| | 44 | 



ORF Name 



NT AA 
NT ID AAID — , — ^ score 
LengthLength 



probability 



117503000991 3439850b t3 354 



Description 
NO-HIT 



] [2034 | [5805 ||232~| |73 | 



ORF Name 



^7503000991 34407888 c3 572 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



[5807 ||729~[|242 ||288 ||i.9e-37 



Description 

gp: [GI:g2952027] [LN : BFU88888] [AC:U88888] [PNrMecA homolog] [GN:mecA] [OR:Bacillus 
firmus] [DB :genpept-bct2] [DE: Bacillus firmus MecA homolog (mecA) and cardiolipin 
synthase (cls)genes, complete cds . ] [LE:349] [RE:1002] [DI: direct] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



— . , score probability 



IAI7503000991 35158177 cl 408 



[2035 | [5808 |puT] |468 ||903 ||i.5e-90 



Description 

gp: [GI:dl039121:g4514345] [LN : AB013374] [AC : AB013374] [PN:Ykok] [GN:ykok] [ORrBacillus 
halodurans] [SR:Bacillus halodurans (strain : C-125) DNA] [DB : genpept-bctl] [DE:Bacillus 
halodurans C-125 mamX, yjdA, ykoK and yvfK genes, partialand complete cds.] [LE:1793] 
[RE: 3142] [DI : complement] 



ORF Name 



NTID 



NT 

AAID 



AA 



LengthLength 



score 



IAI7503000991 35937827 c2 505 



2037 



[5809 ||5TCr"]|169 ||3TT 



probability 
3.3e-31 



Description 

pir: [LN:G69846] [AC:G69846 ] [PN : hypothetical protein yjcG] [GN:yjcG ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI :ell83205:g2 633539] [LN: BSUB0007] [AC : Z99110 : AL009126] 

[GN:yjcG] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:61943] [RE:62458] 

[DI : complement] 



474 



ORF Name 



IAI7503000991 361281*87 c3 621 



Description 
NO-HIT 



NTID 



NT AA 
AAID — — , , score 
LenqthLength 



probability 



J [2038 | [5810 ||155 | |55 [ 



ORF Name 



NTID 



NT 



AA 



AAID — _ — , , score probabili ty 
LengthLength c 



|AI750300099i_36i347i7_ci_40i 



2035 | [5911 |pTT||548 | [g^S - 1|4 . 6e-89 



Description 

pir: [LN:A53310] [AC:A53310 ] [PN: pheromone cADl binding protein precursor : TraC] 
[GNitraC ] [CL : dipeptide transport protein] [OR : Enterococcus faecalis] [DB:pir2] 
>gp: [GI:g388269] [LN : AD1TRAC ] [AC:L19532] [GN:traC] [ORrPlasmid pADl] [SR : Plasmid pADl 
DNA] [DB:genpept-bctl] [DE: Plasmid pADl (from Enterococcus faecalis strain: DS16) 
hemolysinbacteriocin (traC) gene, complete cds, traA and traB genes, 3' end.] [LE:166] 
[RE: 1797] [DI:direct] 



ORF Name 



NTID 



NT 



AA 



^ LengthLength probability 



IAI7503000991 36229625 F3 295 



] [2040 | [5812 |p^||67 | p5~||l . 6e-15 



Description 

pir: [LN:B69869] [AC:B69869 ] [PN : hypothetical protein ykvS] [GN:ykvS ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell84971 :g2633752] [LN: BSUB0008] [AC : Z99111 : AL009126] 
[GN:ykvS] [FN: unknown] [OR: Bacillus subtilis] [DB : genpept - be t 1 ] [DE: Bacillus subtilis 
complete genome (section 8 of 21): from 1394791to 1603020.] [LE:52338] [RE:52769] 
[DI : complement] 



ORF Name 



^TT 

NTID AAID — . _ — , , score 
LengthLength 



probability 



IAI7503000991 36611062 c3 557 



] [2041 | [5813 |pu~||139 | |X7¥"-j[2 . 7e-13 



Description 

pir : [LN:D69837] [AC:D69837 ] [PN : hypothetical protein yisL] [GN : yisL ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell83078 :g2633412] [LN: BSUB0006] [AC : Z99109 : AL009126] 
[GN:yisL] [FN:unknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 6 of 21): from 999501 tol209940.] [LE: 153237] [RE: 153593] 
[DI:direct] >gp: [GI : ell73510 :g2145377] [LN : BSY09476] [AC:Y09476] [PN : YisL] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis 54kb genomic DNA fragment.] [NT : putative] 
[LE: 10463] [RE: 10819] [DI:direct] 



ORF Name 



NTID AAID 



NT 



AA 



LengthLength 



score probability 



AI7503000991_37777__cl_434 

Description 

NO-HIT 



7uT2~ 



[5814 |55 | 



475 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score probability 



IAI7503000991 3954718 ci 394 



[2043 | [ 5815 | |3489||1162 | p5T]| 5 . 7e- 136 

Description 

sp: [LN:ADDB_BACSU] [AC:P23477] [GN : ADDB] [OR:BACILLUS SUBTILIS] [DE : ATP- DEPENDENT 
NUCLEASE SUBUNIT B] [SP:P23477] [DB : swissprot] >pir : [LN : A39432] 
[AC:A39432 :S61272 :A69583 ] [PN : ATP-dependent deoxyribonuclease chain B 
addB: ATP -dependent exonuclease synthesis protein AddB] [GN:addB ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : gl42439] [LN : BACADDAA] [AC:M63489] [PN : ATP -dependent 
nuclease] [GNraddA] [ORrBacillus subtilis] [SR :B . subtilis DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis ATP-dependent nuclease (addA) and (addB), andopen reading frame 
3, partial cds . ] [LE:502] [RE:4002] [DI: direct] >gp : [GI : ell83064 : g2633398] 
[LN:BSUB0006] [AC : Z99109 : AL009126] [PN: ATP-dependent deoxyribonuclease (subunit B) ] 
[GNraddB] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 6 of 21): from 999501 tol209940.] [SP:P23477] [LE:136293] [RE:139793] 
[DI:direct] >gp: [GI : el364880 :g2226191] [LN : BSY14081] [AC:Y14081] [PN : ATP-dependent 
nuclease] [GN:addB] [OR: Bacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis 
chromosomal DNA, region 92 degrees: regionbetween comK and addAB.] [NT : TTG start; see 
ref [3]; In EMBL entry M6 348 9 this] [LE:18588] [RE:22088] [DI:direct] 



ORF Name 



AI7503000991 4019193 cl 418 



NT ID 



12044 



^ m NT AA 

AAID — , » _ — , . score 

LengthLength 



|5816 |[7F5 I p^I 



probability 
3 . 2e-74 



Description 

pir: [LN:G69845] [AC:G69845 ] [PN : enoyl- [acyl-carrier-protein] reductase (NADH) , yjbW] 
[GN:yjbW ] [CL: enoyl- [acyl-carrier-protein] reductase (NADH): short-chain alcohol 
dehydrogenase homology] [OR:Bacillus subtilis] [EC: 1.3. 1.9] [DB:pir2] 

>gp: [GI:ell83192 :g2633526] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbW] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to enoyl- acyl-carrier protein 
reductase] [LE : 52836] [RE: 53645] [DI:direct] 



ORF Name 



NT ID 



AAID 



NT 



AA 



LengthLength 



score 



^17503000991 4070151 c2 472 



[2045 | [5817 |prr-||148 [[Im- 



probability 
2 .4e-12 



Description 

gp: [GI:ell73495:g2145362] [LN:BSY09476] [AC:Y09476] [ PN : AddA] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE :B. subtilis 54kb genomic DNA fragment.] [SP:P23478] [LE:<1] 
[RE: 702] [DI: direct] 



ORF Name 



NT ID 



„™ NT AA 

AAID — — _ score 

Length Length 



probability 



IAI7503000991 4080443 cl 409 



"j [2045 [ [5818 ||Ig5*"]|617 | [1574 ||1 . 2e-15i 



Description 

pir: [LN:A69845] [AC:A69845 ] [PN:Na+/H+ antiporter homolog yjbQ] [GN:yjbQ ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83184 :g2633518] [LN : BSUB0007] 
[AC:Z99110:AL009126] [GN:yjbQ] [FN:unknovm] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 7 of 21) : from 1194391to 1411140.] 
[NT:similar to Na+/H+ antiporter] [LE:45437] [RE:47281] [DI: direct] 



476 



NT AA 

QRF Name NTID AAID Le ~ thLe ^ t fr scor ^ probability 



[Al7503000991_40a8962_c2_b0a | [2047 | [ 5819 |p75~| [ 92 | |7& || 0 ,006b 

Description 

pir : [LN:A70008] [AC:A70008 ] [PN : hypothetical protein yueH] [GN:yueH ] [OR:Bacillus 
subtilis] [DB:pir2] >gp : [GI : ell84255 :g2635673] [LN: BSUB0017] [AC : Z99120 : AL009126] 

[GN:yueH] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 17 of 21) : from 3197001to 3414420.] [LE: 64085] [RE: 64333] 

[DI : complement] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



[AI750300099i_4094433J:iJ72 | [2048 | [5820 || 999 ~]| 332 | |IIgT] |2 . 5e-120 

Description 

sp: [LN:SYW_BACSU] [AC:P21656] [GN : TRPS] [ OR : BAC I LLUS SUBTILIS] [EC:6.1.1.2] 
[DE : (TRPRS) ] [SP:P21656] [DB : swissprot] >pir : [LN : YWBS] [AC : JT0481 : A3 2452 :E6 9726 ] 
[PN: tryptophan- -tRNA ligase, : tryptophanyl- tRNA synthetase] [GN:trpS ] 
[CL : tryptophan- -tRNA ligase] [OR:Bacillus subtilis] [EC:6.1.1.2] [DB:pirl] 

>gp: [GI:gl43786] [LN : BACTRPSA] [AC:M24068] [OR:Bacillus subtilis] [SR : B . subtilis 
(strain QB928) DNA, clone pTSQ2] [DB :genpept-bctl] [DE : B . subtilis trpS gene encoding 

tryptophanyl -tRNA synthetase , complete cds . ] [NT : tryptophanyl -tRNA synthetase (EC 

6.1.1.2)] [LE:171] [RE:1163] [DI: direct] >gp : [GI : ell83162 : g2633496] [LN : BSUB0007] 
[AC:Z99110:AL009126] [PN: tryptophanyl -tRNA synthetase] [GN:trpS] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [EC:6.1.1.2] [DE:Bacillus subtilis complete genome (section 7 of 21): 

from 1194391to 1411140.] [SP:P21656] [LE:23195] [RE:24187] [DI : complement] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 

[AI7S03000&91_40967_ci_407 | [2049 | [5821 |p8~| | 285 | p3 | |3.2e-49 | 

Description 

sp: [LN:YJBO_BACSU] [AC:031613] [GN : YJBO] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 31.5 
KD PROTEIN IN MECA-TENA INTERGENIC REGION] [SP:031613] [DB : swissprot] >pir : [LN : G69844] 
[AC:G69844 ] [PN : conserved hypothetical protein yjbO] [GN:yjbO ] [CL : conserved 
hypothetical protein HI0176] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell83182:g2633516] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbO] [FN:unknown] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to hypothetical proteins] [SP:031613] 
[LE:43604] [RE:44455] [DI: direct] 



NT AA 

QRF Name NTID AAID Le ~ thLe ~ th score probability 



AI7S0300099i_4100393_t3_379 | [2050 | [ 5822 |pTF] |404 | |TT5F]| 2 . 3e-149 



Description 

pir: [LN:B69589] [AC:B69589 ] [PN : argininosuccinate synthase argG] [GN:argG ] 
[CL:argininosuccinate synthase] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84194:g2635410] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN : argininosuccinate 
synthase] [GN:argG] [FN:arginine biosynthesis] [OR:Bacillus subtilis] [DB :genpept-bctl] 
[EC:6.3.4.5] [DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 
3013540.] [LE:217075] [RE:218286] [DI : complement] >gp : [GI : el 185818 :g2 635429] 
[LN:BSUB0016] [AC : Z99119 : AL009126] [PN : argininosuccinate synthase] [GNrargG] 
[FN:arginine biosynthesis] [OR:Bacillus subtilis] [DB :genpept-bctl] [EC:6.3.4.5] 
[DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 3213410.] 
[LE:14435] [RE:15646] [DI : complement ] >gp : [GI :g2293242] [LN : AF008220] [AC : AF008220] 
[PN:arginine succinate synthase] [GN:argG] [OR:Bacillus subtilis] [DB :genpept-bct2] 
[DE:Bacillus subtilis rrnB-dnaB genomic region.] [LE: 164781] [RE: 165992] [DI: direct] 



477 



ORF Name 



NTID AAID 



AI750300099i_4I7J410_c2_504 



NT AA 
— , «^ _ — .score 
LengthLength 



20si | [ 5823 |rrar-||253 



probability 
13 . Oe-46 



Description 

pir: [LN:H69846] [AC:H69846 ] [PN : hypothetical protein yjcH] [GN : y j cH ] [ORrBacillus 
subtilis] [DB:pir2] >gp: [GI : ell83206 : g2633540] [LN: BSUB0007] [AC : Z99110 : AL009126] 

[GN : yj cH] [FNiunknown] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21): from 1194391to 1411140.] [LE:62495] [RE:63217] 

[DI : complement] 



ORF Name 



NT 



AA 



NTID AAID , — 



LengthLength 



score 



|&17503000991_415i3i2_c2_527 



[2TT5T 



probability 
[5824 ||606 ||201 ~| |93 ||0.0i0 



Description 

pir: [LN:A71455] [AC:A71455 ] [PN:probable acetyltransf erase] [GN:PH0296 ] 
[OR:Pyrococcus horikoshii] [DB:pir2] >gp : [GI : dl030311 :g3256685] [LN : AP000001] 
[AC:AP000001 :AB009465 :AB009464 :AB009466 :AB009467 :AB009468 :AB00946 9] [PN: 172aa long 
hypothetical acetyltransf erase] [GN:PH02 96] [OR : Pyrococcus horikoshii] [SR : Pyrococcus 
horikoshii (strain:OT3) DNA] [DB : genpept-bctl] [DE : Pyrococcus horikoshii OT3 genomic 
DNA, 1-287000 nt . position (1/7).] [NT:similar to PIR:A64491 percent identity: 42.029 
in] [LE: 262066] [RE: 262584] [DI : complement] 



ORF Name 



NTID 



„ TT% NT AA 

AAID — -« — , score 

LengthLength 



probability 



E 



17503000991 4335752 tl 4 



|5825 |[T7I~| 



Description 
NO-HIT 



ORF Name 



_ _ NT AA 

NTID AAID — _ T — , , score 

LengthLength 



probability 



E 



17503000991 4546890 t2 174 



[2054 | [5825 || 150 ~| [49 | 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID AAID , 



Length Length 



score 



probability 



1AI7503000991 4687518 c2 519 



[2055 | [5827 [ 194 |[ 179 || 8.0e-14 

Description 

sp: [LN:COMK_BACSU] [AC:P40396] [GN:COMK] [OR:BACILLUS SUBTILIS] [DE : COMPETENCE 
TRANSCRIPTION FACTOR (CTF) (COMPETENCE PROTEIN K) ] [SP:P40396] [DB : swissprot] 
>pir: [LN:S43611] [AC : S43611 : A69604 ] [PN : competence transcription factor (CTF) comK] 
[GN: comK ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83044 :g2633378] [LN:BSUB0006] 
[AC:Z99109:AL009126] [PN : competence transcription factor (CTF)] [GN : comK] [FN:final 
autoregulatory control switch prior to] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[SP:P40396] [LE:117081] [RE:117659] [DI: direct] >gp : [GI :g546917] [LN:S70734] 
[AC:S70734] [GN: comK] [OR:Bacillus subtilis] [SR:Bacillus subtilis E26] 
[DB:genpept-bctl] [DE : comK [Bacillus subtilis, E26, Genomic, 3 genes, 1947 nt] .] 
[NT: This sequence comes from Fig. 3.] [LE:900] [RE: 14 78] [DI: direct] 



ORF Name 



A17b03000991_4688750_t3_2B3 

Description 

NO-HIT 



NT AA 
NTID AAID — . ^ — , scor e 
LengthLength 

[2056 | |582fl ||174 1 157 | 



probability 



478 



ORF Name 



^17503000991 4726010 cl 413 



NTID 



NT AA 
AAID — . T — t , score 
LengthLength 



probability 



] [2057 | [ 5829 |pW][7^7 1 [T7Tr|| 3 : 7e-183 



Description 

gp: [GI:gll96907] [LN : STMDRRC] [AC:L76359] [PN : daunorubicin resistance protein] 
[GN:drrC] [OR : Streptomyces peucetius] [DB : genpept-bctl] [DE : Streptomyces peucetius 

daunorubicin resistance protein (drrC) gene , complete cds.] [LE:991] [RE: 3285] 
[DIrdirect] 



ORF Name 



IA17503000991 4781275 12 194 



Description 
NO-HIT 



NTID 



NT AA 
AAID — . — , . score 
LengthLength 



probability 



[5830 |p*9~| [82 | 



ORF Name 



IAI7503000991 4812675 cl 412 



NTID 



A7VTr , NT AA 

AAID — , . _ — , . score 

LengthLength 



probability 



2059 [ [5831 ||105 [[273 ||3 . le-23 



Description 

sp: [LN:G6PD_BACSU] [AC:P54547] [GN:ZWF] [OR:BACILLUS SUBTILIS] [EC : 1 . 1 . 1 . 49] 
[DE: PROTEIN 11) (VEG11) ] [SP:P54547] [DB : swissprot ] >pir : [LN : B69964] [AC:B69964 ] 
[PN:probable glucose -6 -phosphate 1- dehydrogenase, yqjJ] [GN:yqjJ ] 

[CL:glucose-6 -phosphate dehydrogenase] [OR:Bacillus subtilis] [EC : 1 . 1 . 1 . 49] [DB:pir2] 
>gp: [GI:dl013296 :gl303961] [LN : BACJH642] [AC : D84432 : D82370] [PN:YqjJ] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 239025] 
[RE:240494] [DI : complement ] >gp: [GI : e!185654 :g2634820] [LN : BSUB0013] 

[AC:Z99116:AL009126] [GN:yqjJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT:similar to glucose- 6 -phosphate 1- dehydrogenase] [SP:P54547] [LE:83125] [RE:84594] 
[DI: direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



E 



17503000991 48587 13 381 



j [2060 | [5832 [[Tu^T] [353 [[729 \fT7Ze=JT 



Description 

sp : [LN : GLPQ_BACSU] [AC:P37965] [GN: GLPQ] [OR:BACILLUS SUBTILIS] [EC : 3 . 1 . 4 . 46] 
[DE: ( GLYCEROPHOS PHODI ESTER PHOSPHODIESTERASE)] [SP:P37965] [DB : swissprot] 
>pir: [LN:I40418] [AC : 140418 : E69634 : S37251 ] [PN : glycerophosphoryl diester 
phosphodiesterase glpQ] [GN:glpQ ] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:dl034076 :g3599635] [LN : AB006424 ] [AC : AB006424] [GN:ybeD] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis (strain:168) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis genomic 
DNA, 70 kb region between 17 and 23degree.] [LE: 36004] [RE: 36885] [DI : complement] 
>gp: [GI:g403373] [LN : BSGLPTQ] [AC:Z26522] [PN : glycerophosphoryl diester 
phosphodiesterase] [GN: glpQ] [FN: hydrolysis of deacylated phospholipids] [OR:Bacillus 
subtilis] [DB : genpept-bctl] [DE : B . subtilis glpT and glpQ genes for glycerol 3-phosphate 
permeaseand glycerophosphoryl diester phosphodiesterase.] [SP:P37965] [LE:1748] 
[RE:2629] [DI:direct] >gp: [GI :ell82165:g2 632499] [LN: BSUB0002] [AC : Z99105 : AL009126] 
[PN: glycerophosphoryl diester phosphodiesterase] [GN:glpQ] [FN: hydrolysis of deacylated 
phospholipids] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC : 3 . 1 . 4 . 46] [DE:Bacillus 
subtilis complete genome (section 2 of 21): from 194651 to415810.] [NT : alternate gene 
name: ybeD] [SP:P37965] [LE: 38353] [RE: 39234] [DI : complement] 



ORF Name 



^£7503000991 4882133 cl 435 



NTID 



NT AA 
AAID — — score 
Length Length 



probability 



[5833 |p^~| |49 | 



Description 
NO-HIT 



479 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th ^gore probability 



AI7S0300099i_4Ba2aOfe_c2J>24 | [2062 | |5B34 ||522 1 |173 | p^S - 1|3 . 9e-37 



Description 

pir: [LN:D69838] [AC:D69838 ] [PN : conserved hypothetical protein yisU] [GNryisU ] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell83088 : g2633422] [LN:BSUB0006] 
[AC:Z99109 :AL009126] [GNryisU] [FNiunknown] [ORrBacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 6 of 21): from 999501 tol209940.] 
[NT: similar to hypothetical proteins] [LE: 165981] [RE: 166643] [DI : complement] 
>gp: [GI :ell73520 :g2145387] [LN : BSY09476] [AC:Y09476] [PN:YisU] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE : B . subtilis 54kb genomic DNA fragment.] [NT : putative] [LE:23207] 
[RE : 23869] [DI : complement] 



NT AA 

ORF Name NTID AAID — — score probability 



Length Length - 

AI7B0300095i_496i000_ti_6 | [ 2063 | [ 5335 || 123 1 140 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID — — score probability 



IAI7503000591 5085003 fl 118 



LengthLength - 
2064 | [5836 |pTC5~| |484 ||1439 ||2.4e-147 



Description 

pir: [LN:C69589] [AC:C69589 ] [PN : argininosuccinate lyase argH] [GN:argH ] 
[CL:argininosuccinate lyase] [OR:Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell84193:g2635409] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN : argininosuccinate 
lyase] [GN:argH] [FN:arginine biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[EC:4.3.2.1] [DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 
3013540.] [LE:215693] [RE:217078] [DI : complement] >gp : [GI : ell85817 : g263 5428] 
[LN:BSUB0016] [AC : Z99119 : AL009126] [PN : argininosuccinate lyase] [GN:argH] [FN:arginine 
biosynthesis] [OR:Bacillus subtilis] [DB : genpept-bctl] [EC:4.3.2.1] [DE:Bacillus 
subtilis complete genome (section 16 of 21): from 2997771to 3213410.] [LE:13053] 
[RE:14438] [DI : complement] >gp : [GI : g2293243] [LN: AF008220] [AC : AF008220] [PN:arginine 
succinate lyase] [GN:argH] [OR:Bacillus subtilis] [DB : genpept - bet 2 ] [DE:Bacillus 
subtilis rrnB-dnaB genomic region.] [LE: 165989] [RE: 167374] [DI:direct] 



ORF Name 


NT AA 

NTID AAID L — — th score probability 


AI7503000991_51i5927_c2_528 


| 2055 | |5837 ||1S80||559 ||340 ||1.3e-59 



Description 

pir : [LN:S76520] [AC:S76520 ] [PN : hypothetical protein] [OR : Synechocystis sp . ] [SR:PCC 
6803, , PCC 6803] [SR:PCC6803, ] [DB:pir2] >gp : [GI : dl011017 : gl001635] [LN : SYCSLRD] 
[AC:D64002 :AB001339] [PN: 2 -succinyl -6-hydroxy-2 , ] [GN:menD] [OR : Synechocystis sp.] 
[SR: Synechocystis sp. (strain : PCC6803) DNA] [DB : genpept-bctl] [DE : Synechocystis sp. 
PCC6803 complete genome, 21/27, 2644795-2755702.] [NT : ORF_ID : si 10 6 03] [LE:25321] 
[RE: 27108] [DI : complement ] 

NT AA 

ORF Name NTID AAID — «^ — , score probab ility 

LengthLength £ - 



|&17503000951_5i657ii_t3_336 | [2066 | [5535 ||?rr~] | 148 | . ie-30 



Description 

pir: [LN:A69844] [AC:A69844 ] [PN : hypothetical protein yjbl] [GN:yjbI ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell83176:g2 633510] [LN : BSUB0007] [AC : Z99110 : AL009126] 
[GN:yjbI] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 7 of 21) : from 1194391to 1411140.] [LE: 39591] [RE: 39989] 
[DI : complement] 



480 



NT AA 

ORF Name NTID AAID Le ~ thLe ^ t fr score probability 



[Al7b0300099i_>2b02b8_cl_403 | [2057 | [ 5839 |p?Tj [ 512 | [TTST p . 4e-179 

Description 

gp: [GI:dl014255:gl651216] [LNrD88209] [AC:D88209] [PN : Pz-peptidase] [ORrBacillus 
licheniformis] [SRrBacillus lichenif ormis (strain:N22) DNA] [DB rgenpept-bctl] 
[DE:Bacillus licheniformis DNA for Pz-peptidase, complete cds . ] [LE:238] [RE:2124] 
[DI: direct] 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 



A1750300099i_57944i_ci_385 | [2058 | [5840 ||r5u~| p" 



Description 
NO-HIT 



ORF Name 



NTID 



„ TT% NT AA 

AAID — _ — , , score 

Length-Length 



probability 



E 



.17503000991 580307 r2 245 



Description 
NO-HIT 



2059 | [5841 |pTT2 j [3T 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



IAI7503000991 6057943 c2 517 



] [2070 | [ 5842 | |999 "] [ 332 | piT 



probability 
| |9.2e-100 



Description 

pir: [LN:G69830] [AC:G69830 ] [PN : lipoate-protein ligase homolog yhfJ] [GN:yhfJ ] 
[CL:lipoate-protein ligase] [ORrBacillus subtilis] [DB:pir2] >gp : [GI : ell83027 : g2633361] 
[LN:BSUB0006] [AC r Z99109 r AL009126] [GNryhfJ] [FNrunknown] [ORrBacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 6 of 21) : from 999501 
tol209940.] [NT:similar to lipoate-protein ligase] [LE:99131] [RE:100126] [DIrdirect] 
>gp: [GI:e325016:g2226243] [LN: BSY14083] [AC:Y14083] [PN : hypothetical protein] [GN:yhfJ] 
[ORrBacillus subtilis] [DB rgenpept-bctl] [DErBacillus subtilis chromosomal DNA, region 
76-78 degreesr betweenglyB-aprE . ] [NT r Similarity to several lipoate-protein ligases] 
[LEr 18990] [RE r 19985] [DIrdirect] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score probability 



[AI7503000991 5588757 t2 123 



] [2071 | [5843 [pn— 1 |158 | pi~ ]| 4 . 6e-82 



Description 

gpr [GIrg2267240] [LN r SEU71377] [ACrU71377] [OR : Staphylococcus epidermidis] 
[DB rgenpept-bctl] [DE : Staphylococcus epidermidis autolysin AtlE and 

putativetranscriptional regulator AtlR genes, complete cds.] [NT r ORF2 ] [LEr913] 
[RErl383] [DIrdirect] 



481 



NT AA 

ORF Name NT ID AAID Le ~ thLen "^ t h score probability 



A17b03000991J>921877_cl_410 | [ 20 7 2 | [5844 |pX9 — | [72 | [13? 



|2.3e-14 



Description 

sp: [LN:G6PD_BACSU] [AC:P54547] [GN:ZWF] [OR:BACILLUS SUBTILIS] [EC : 1 . 1 . 1 .49] 
[DE: PROTEIN 11) (VEG11) ] [SP:P54547] [DB : swissprot] >pir : [LN : B69964] [AC:B69964 ] 
[PN:probable glucose -6 -phosphate 1- dehydrogenase, yqjJ] [GNiyqjJ ] 

[CL: glucose -6 -phosphate dehydrogenase] [OR:Bacillus subtilis] [EC : 1 . 1 . 1 . 49] [DB:pir2] 
>gp: [GI:dl013296:gl303961] [LN : BACJH642 ] [AC :D8 4432 :D8 2370] [PN:YqjJ] [ORrBacillus 
subtilis] [SR:Bacillus subtilis (strain : JH642 (trpC2 PheAl) ) DNA] [DB :genpept-bctl] 
[DE:Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE:239025] 
[RE:240494] [DI : complement] >gp : [GI : ell85654 : g2634820] [LN : BSUB0013] 

[AC:Z99116:AL009126] [GN:yqjJ] [FN:unknown] [ORrBacillus subtilis] [DB :genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT: similar to glucose- 6 -phosphate 1- dehydrogenase] [SP:P54547] [LE: 83125] [RE: 84594] 
[DI : direct] 



NT AA 

ORF Name NTID AAID — . — , score probability 



Length Length - 

AI750300033i_7032752_t3_268 | [2073 | [5845 | | 75S ~] [ 254 | ^JT\\ 2 . 5e-136 



Description 

gp: [GI:g2267239] [LN: SEU71377] [AC:U71377] [OR : Staphylococcus epidermidis] 
[DB:genpept-bctl] [DE : Staphylococcus epidermidis autolysin AtlE and 

putativetranscriptional regulator AtlR genes, complete cds.] [NT : ORF1] [LE:<1] [RE: 865] 
[DI : direct] 

NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

[AI7503000991_70817i2_tl_27 | [2074 [ [5845 |p^] [ 77 |p | |0. 00093 "| 

Description 

pir: [LN:H69835] [AC:H69835 ] [PN : hypothetical protein yhzC] [GN:yhzC ] [OR:Bacillus 
subtilis] [DB:pir2] >gp: [GI :ell83043 :g2633377] [LN: BSUB0006] [AC : Z99109 : AL009126] 

[GN:yhzC] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept -bet 1 ] [DE:Bacillus subtilis 
complete genome (section 6 of 21): from 999501 tol209940.] [LE:116555] [RE:116788] 

[DI : complement] 



_ NT AA 
ORF Name NTID AAID — , — , , score probability 
LengthLength ^ JL 



|A171>0300099I 7145250 F3 337 



[2075 | [5847 ||243 ~[|80 | 



Description 
NO-HIT 



482 



NT AA 

ORF Name NTID AAID Le ^ t frLength SCQre probability 



AI750300099i_7228438_c2_471 | [2075 | [ 5848 | pT7T] [1076 [ [ 1793 | | 7 . 4e-I8b | 



Description 

spr [LNrADDA_BACSU] [AC:P23478] [GN : ADDA] [OR : BACILLUS SUBTILIS] [DE : ATP -DEPENDENT 
NUCLEASE SUBUNIT A] [SP:P23478] [DB : swissprot] >pir : [LN : B39432 ] [AC : B3 9432 : H69582 ] 
[PN: ATP -dependent deoxyribonuclease chain A addA: ATP -dependent exonuclease synthesis 
protein AddA] [GN:addA ] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :gl42440] 
[LN : BACADDAA] [AC:M63489] [PN : ATP -dependent nuclease] [GNraddB] [ORrBacillus subtilis] 
[SR:B. subtilis DNA] [DB : genpept-bctl] [DE: Bacillus subtilis ATP-dependent nuclease 
(addA) and (addB) , andopen reading frame 3, partial cds . ] [LE:3988] [RE:7687] 
[DIrdirect] >gp : [GI :ell83065:g2 633399] [LN : BSUB0006] [AC : Z99109 : AL009126] 
[PN: ATP-dependent deoxyribonuclease (subunit A)] [GNraddA] [ORrBacillus subtilis] 
[DB : genpept-bctl] [DErBacillus subtilis complete genome (section 6 of 21) : from 999501 
tol209940.] [NTralternate gene name: recE5] [SP:P23478] [LE:139780] [RE:143478] 
[DIrdirect] >gp : [GI : el364881 : g2226192] [LN : BSY14081] [AC:Y14081] [PN : ATP-dependent 
nuclease] [GNraddA] [ORrBacillus subtilis] [DB rgenpept-bctl] [DErBacillus subtilis 
chromosomal DNA, region 92 degrees: regionbetween comK and addAB.] [NT r see ref [3]; In 
EMBL entry M63489 this gene is] [LE:22075] [RE:25773] [DIrdirect] 



ORF Name 



NT AA 
NTID AAID — _ — , , score 
LengthLength 



probability 



^7503000991 7241300 c2 502 



"| [2077 | [5849 |p5~ ||44 | 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
AAID — _ — . . score 
LengthLength 



probability 



UE7S03000991 7308375 c3 557 



] [2078 | [5850 |p^"| |313 ||791 ||l.le-78 



I 

Description 

pir: [LNrH69668] [AC : H69668 : S15231 r B3844 7 ] [PN : oligopeptide transport system permease 
protein oppB r sporulation initiation protein spoOKB] [GNr oppB r spoOKB ] [CL r oligopeptide 
permease protein oppB] [ORrBacillus subtilis] [DBrpir2] >gp r [GI r ell83164 rg2633498] 
[LNrBSUB0007] [AC r Z99110 r AL009126] [PN r oligopeptide ABC transporter (permease)] 
[GN r oppB] [FNrrequired for initiation of sporulation,] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 7 of 21): from 
1194391to 1411140.] [NTralternate gene name: spoOKB] [LE:26676] [RE:27611] [DIrdirect] 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



IAI7503000991 782816 cl 396 



|585i |[T3Tr||6i8 | [775~||3 ■ 7e-116 



Description 

pir: [LNrC6 9975] [AC:C69975 ] [PN r acyltransf erase homolog yrhL] [GNryrhL ] [ORrBacillus 
subtilis] [DB:pir2] >gp r [GI rgl934616] [LNr BSU93874] [AC:U93874] [PN r hypothetical 
protein YrhL] [GNryrhL] [ORrBacillus subtilis] [DB rgenpept-bctl] [DErBacillus subtilis 
cysteine synthase (yrhA) , cystathioninegamma- lyase (yrhB) , YrhC (yrhC) , YrhD (yrhD) , 
formate dehydrogenasechain A (yrhE) , YrhF (yrhF) , formate dehydrogenase (yrhG) , 
YrhH(yrhH), regulatory protein (yrhl) , cytochrome P450 102 (yrhJ) , YrhK(yrhK), 
hypothetical protein YrhL (yrhL), putative anti-SigV factor (yrhM) , RNA polymerase sigma 
factor SigV (sigV) and YrhO (yrhO) genes, complete cds, and YrhP (yrhP) gene, partial 
cds.] [NT: similar to Haemophilus influenzae hypothetical] [LEr 13904] [RE r 15808] 
[DIr complement] >gpr [GI : ell83944 rg2635160] [LNr BSUB0014] [AC r Z99117 : AL009126] [GNryrhL] 
[FN r unknown] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete 
genome (section 14 of 21) r from 2599451to 2812870.] [NT r similar to acyltransf erase] 
[LEr 171138] [RE r 173042] [DI rdirect] 



483 



ORF Name 



NT ID 



AAID' 



NT 



AA 



LengthLength 



score probability 



IAI7503000991 813412 cl 411 



] [ 2090 | [5852 ||156 1 151 | pi~||3 . 0e-14 



Description 

sp: [LN:G6PD_BACSU] [AC:P54547] [GN:ZWF] [OR : BACILLUS SUBTILIS] [EC : 1 . 1 . 1 . 49] 
[DE: PROTEIN 11) (VEG11) ] [SP:P54547] [DB : swissprot] >pir : [LN : B69964] [AC:B69964 ] 
[PN: probable glucose- 6 -phosphate 1- dehydrogenase, yqjJ] [GNryqjJ ] 

[CL: glucose- 6 -phosphate dehydrogenase] [OR:Bacillus subtilis] [EC : 1 . 1 . 1 . 49] [DB:pir2] 
>gp: [GI:dl013296:gl303961] [LN : BACJH642] [AC : D84432 : D823 70] [PNrYqjJ] [ORrBacillus 
subtilis] [SR:Bacillus subtilis (strain: JH642 (trpC2 PheAl) ) DNA] [DB : genpept-bctl] 
[DE: Bacillus subtilis DNA, 283 Kb region containing skin element.] [LE: 239025] 
[RE:240494] [DI : complement] >gp: [GI : ell85654 :g2634820] [LN : BSUB0013] 

[AC:Z99116:AL009126] [GN:yqjJ] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 13 of 21): from 2395261to 2613730.] 
[NT: similar to glucose -6 -phosphate 1- dehydrogenase] [SP:P54547] [LE: 83125] [RE: 84594] 
[DI: direct] 



ORF Name 



IA17503000991 859838 13 357 



Description 
NO-HIT 



NTID 



A7VTrx NT AA 

AAID — . _ — , . score 

LengthLength 



probability 



|5853 | ^T, 



ORF Name 



^17503000991 978436 t2 126 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



[2082 | [5854 ||il88 | |395 ||grr~||9 ■ 9e-87 



Description 

sp: [LN:PATA_BACSU] [AC:P16524] [GN : PATA : UAT] [OR:BACILLUS SUBTILIS] [EC:2.6.1.-] 
[ DE : PUT AT I VE AMINOTRANSFERASE A,] [SP:P16524] [DB : swissprot] >gp : [GI : ell81905 : g2632221] 
[LN:BS16829KB] [AC : AJ222 587] [PN:PatA protein] [GN :patA] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE:Bacillus subtilis 29kB DNA fragment from ykwC gene to csel5 
gene.] [NT:hisC homologue] [SP:P16524] [LE:6213] [RE:7391] [DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID 

LengthLength 



score 



IAI7503000991 985887 c3 570 



2083 | [5855 |p3~| |144 | pTTT 



probability 
|2.9e-55 



Description 

pir: [LN:D69843] [AC:D69843 ] [PN : conserved hypothetical protein yjbD] [GN:yjbD ] 
[CL: hypothetical protein yjbD] [OR: Bacillus subtilis] [DB:pir2] 

>gp: [GI:ell83170:g2633504] [LN : BSUB0007] [AC : Z99110 : AL009126] [GN:yjbD] [FN:unknown] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis complete genome (section 
7 of 21): from 1194391to 1411140.] [NT:similar to hypothetical proteins] [LE:32779] 
[RE : 33174] [DI : direct] 



ORF Name 



IAI7503000991 9927077 13 378 



Description 
NO-HIT 



NTID 



NT AA 
AAID — , _ — , . score 
LengthLength 



probability 



[2084 | [5856 ||i47""]|48 | 



ORF Name 



IAI7503000992 10008513 c3 117 



NT 



AA 



NTID AAID — — , . score probability 
LengthLength c 



[5857 ||2667||888 |p^T]pTTT 



Description 

pir : [LN:SYBSVS] [AC:A26738 ] [PN : valine- -tRNA ligase, :valyl-tRNA synthetase] [GN:valS ] 
[CL: valine --tRNA ligase] [OR:Bacillus stearothermophilus] [EC:6.1.1.9] [DB:pirl] 



484 



ORF Name 



NTID 



NT 



AA 



AAID , , , 



LengthLength 



score 



probability 
2086 | [5858 |[Q7r[|442 ||I33T]| 3 -6e-137 



|A17503000992_10720877_cl_78 

Description 

pir : [LN:F69723] [AC:F69723 ] [PN:trigger factor (prolyl isomerase) tig] [GN:tig ] 
[CL:trigger factor] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell84072 :g2635288] 
[LN:BSUB0015] [AC : Z99118 : AL009126] [PN:trigger factor (prolyl isomerase)] [GN:tig] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE: Bacillus subtilis complete genome (section 

15 of 21): from 2795131to 3013540.] [NT : alternate gene name: yzzH] [SP:P80698] 
[LE:90286] [RE:91560] [DI : complement] >gp: [GI : ell65373 :gl770074] [LN: BSZ75208] 
[AC:Z75208] [PN:trigger factor] [GN:tig] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis genomic sequence 89009bp.] [NT: homology to trigger factor of 

Haemophilus] [SP:P80698] [LE: 78347] [RE: 79621] [DI: direct] 



ORF Name 



NT AA 
NTID AAID — , T — . score 
LengthLength 



probability 



1A17503000992 11954500 t3 71 



][ 



Description 
NO-HIT 



[5859 j|T72 | [ST 



ORF Name 



NT AA 
NTID AAID — — , score 
LengthLength 



probability 



IAI7503000992 1211562 c2 94 



2083 | [5860 |[126J]|422 | pTF]|l . Oe-155 



Description 

gp: [GI:e221213:gl296452] [LN : BSCLPXGEN] [AC:X95306] [PN:ClpX protein] [GN:clpX] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE :B . subtilis clpX gene.] [NT : ATP -dependent 
protease] [SP:P50866] [LE:168] [RE:1430] [DI: direct] 



ORF Name 



AI7503000992 14725300 c3 108 



NTID 



NT AA 

AAID , — _ T — ^, score probability 
Length Length J - 



[5861 



Description 
NO-HIT 



ORF Name 



IAI7503000992 15820252 cl 81 



Description 
NO-HIT 



ORF Name 



IA17503000992 16835900 c2 104 



NT AA 
NTID AAID — _ — _ score 
LengthLength 



NTID 



2091 



probability 



j [2090 | [5852 ||141 | [46 [ 



_ _ NT AA 

AAID — — score 

LengthLength 



ITS" 



73" 



probability 
10.011 



Description 

gp: [GI:gll23053] [LN : CELF59A6] [AC:U41994] [GN:F59A6.2] [OR : Caenorhabditis elegans] 
[SR:Caenorhabditis elegans strain=Bristol N2] [DB : genpept - invl ] [DE : Caenorhabditis 
elegans cosmid F59A6.] [LE : 19719 : 2 0072] [RE : 19904 : 20242] [DI : direct Join] 



485 



ORF Name 



NT AA 
NTID AAID — _ „ — , , score 
LengthLength 



probability 



AI7503000992 188388 C3 122 



J [2092 | [5864 |p73~||192 "|[220 ||3.5e-18 



Description 

Sp : [LN : MREC_BACSU] [AC:Q01466] [GN-.MREC] [OR:BACILLUS SUBTILIS] [DE : ROD 

SHAPE -DETERMINING PROTEIN MREC] [SP:Q01466] [DB : swissprot] >pir : [LN : JC4595] [AC:JC4595 
] [PNrcell shape determinant MreC : protease secretion stimulating protein, MreC] 
[GN:mreC ] [OR: Bacillus stearothermophilus] [DB:pir2] >pir: [LN:C45240] 
[AC:C45240:D45239:S27518 ] [PN:cell shape determinant mreC] [GN:mreC ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI :gl43213] [LN: BACMREMIN] [AC:M95582] [GN:mreC] [FN:cell 
shape determining] [ORrBacillus subtilis] [SR:Bacillus subtilis (strain W168) DNA] 
[DB:genpept-bctl] [DE: Bacillus subtilis (mreB) gene complete cds, (mreC) gene 
completecds, (mreD) gene complete cds, (minC) gene complete cds, (minD)gene complete 
cds.] [NT:putative] [LE:1370] [RE:2242] [DI: direct] 



ORF Name 



NTID AAID 



NT 



AA 



Length Length 



score probability 



IAI7503000992 19707767 ti 4 



Description 
NO-HIT 



[5865 |pmr"||i07 | 



ORF Name 



NT AA 
NTID AAID — , — , score 
Length Length 



probability 



IAI7503000992 20348160 ti 1 



[5866 ||73 ||71 ||0 . 022 



Description 

gp: [GI:gll31474] [LN : PBU42580] [AC : U4 2580: Ul 7055: U3 2570] [GN:A130R] [OR : Paramecium 
bursaria Chlorella virus 1] [DB :genpept-vrl] [DE : Paramecium bursaria Chlorella virus 1, 
complete genome.] [LE: 69061] [RE: 69378] [Dlidirect] 



ORF Name 



NTID 



NT AA 
AAID — — , , score 
LengthLength 



probability 



IAI7503000992 21500253 c3 109 



Description 
NO-HIT 

ORF Name 



2095 | [5867 ||m~~|p | 



IAI7503000992 21563137 c2 102 



NTID 
] 12096 



„ TO NT AA 

AAID — — , , score 

LengthLength 



probability 



[5868 ||135 [ |5T" 



Description 
NO-HIT 



ORF Name 



NTID 



_ _ -yj— ^ NT AA 

AAID — — score 

LengthLength 



probability 



AI7S03000992_23444187_t3_62 

Description 

NO-HIT 



j [2097 | [5869 |53 | 



486 



ORF Name 



NTID 



AAID 



NT 



AA 



Length Length 



score probability 



|Ai7t>030009S2 23b22b 57 c2 9i 



2098 | [ 58 7 0 |pT~] |126 | p>0 | | 1.5e-4 T 



] 



Description 

sp: [LN:RL20_BACSU] [AC:P55873] [GN : RPLT] [OR : BACILLUS SUBTILIS] [DE:50S RIBOSOMAL 
PROTEIN L2 0] [SP:P55873] [DB : swissprot] >pir: [LN:F69696] [AC:F69696 ] [PN : ribosomal 
protein L2 0 rplT] [GN : rplT ] [CL : Escherichia coli ribosomal protein L2 0] [OR: Bacillus 
subtilis] [DB:pir2] >gp: [GI : ell84134 :g2635350] [LN:BSUB0015] [AC : Z 99118 : AL009126] 
[PN: ribosomal protein L20] [GN : rplT] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 15 of 21): from 2795131to 3013540.] 
[SP:P55873] [LE:156202] [RE:156561] [DI : complement] >gp : [GI : el 16 5301 :gl 770009] 
[LN:BSZ75208] [AC:Z75208] [PN : ribosomal protein L20] [GN : rplT] [OR:Bacillus subtilis] 
[DB : genpept-bctl] [DE :B . subtilis genomic sequence 89009bp.] [NT: homology to rplT of 
Bacillus stearothermphilus;] [SP:P55873] [LE: 13346] [RE: 13705] [DI: direct] 



ORF Name 



NTID 



NT AA 
AAID — , ■] _ — . . score 
LengthLength 



probability 



IA17503000992 23531528 Tl 28 



2uW 



|587l j|T¥3 | 



Description 
NO-HIT 



ORF Name 



IAI7503000992 23697141 c2 97 



NT 



AA 



NTID AAID — . r — , , score probabilit y 
Length Length t - 

] |2100 | [5872 |pu~|p09 | pTT||i . 8e-135 



Description 

gp: [GI:g2589181] [LN:SAU89396] [AC:U89396] [PN: porphobilinogen deaminase] [GN:hemC] 
[OR staphylococcus aureus] [DB :genpept-bct2] [EC: 4. 3. 1.8] [DE : Staphylococcus aureus 
hemCDBL gene cluster: porphobilinogendeaminase (hemC) , uroporphyrinogen III synthase 
(hemD) , d- aminolevulinic acid dehydratase (hemB) and GSA-l-aminotransf erase (hemL) genes, 
complete cds . ] [LE:219] [RE: 1145] [DI:direct] 



ORF Name 



NT AA 
NTID AAID — — _ score 
L en gthLength 



probability 



AI7503000992 24017052 C2 103 



TTUT 



Description 
NO-HIT 



|5873 |pr5"D [ |49 j 



ORF Name 



NTID 



AAID _ — . 



IAI7503000992 24240877 c3 119 



][ 



Description 
NO-HIT 



NT AA 
LengthLength 
[5874 ||522~[p73" 



score 



probability 



ORF Name 



NTID 



AAID 



IAI7503000992 24625151 cl 87 



TTUT 



5875 Ifl^T 



NT AA 
LengthLength 
ffS 1 



score probability 
1160 ||8.2e-12 



Description 

sp: [LN:RADC_STAAU] [AC:P31337] [GN: RADC] [OR : STAPHYLOCOCCUS AUREUS] [DE:DNA REPAIR 
PROTEIN RADC HOMOLOG (25 KD PROTEIN) (FRAGMENT)] [SP:P31337] [DB : swissprot] 
>gp: [GI:g551992] [LN : STATNIS5] [AC:K02985] [OR : Staphylococcus aureus] [SR:S. aureus 
(strain RN450) DNA] [DB : genpept-bctl] [DE:S. aureus (strain RN450) transposon Tn554 
insertion site.] [NT:25 kD protein (putative); putative] [LE:<1] [RE:>249] [DI:direct] 



487 



ORF Name 



NTID 



NT AA 
AAID — , — .score 
Length Length 



probability 



[A17503000992 24^973 12 c3 112 



][ 



[ 58 7 6 [[ 186 — 1|<51 |p75~|| 7.1e-22 



Description 

sp: [LN:HEM2_STAAU] [AC:P50915] [GN : HEMB] [OR : STAPHYLOCOCCUS AUREUS] [EC : 4 . 2 . 1 . 24] 
[DE : SYNTHASE) (ALAD) (ALADH) ] [SP:P50915] [DB : swissprot] >gp : [GI :g2589183] 
[LN:SAU89396] [AC:U89396] [PN : d- aminolevulinic acid dehydratase] [GN:hemB] 
[OR: Staphylococcus aureus] [DB : genpept -bct2 ] [EC : 4 . 2 . 1 . 24] [DE : Staphylococcus aureus 
hemCDBL gene cluster: porphobilinogendeaminase (hemC) , uroporphyrinogen III synthase 
(hemD) , d-aminolevulinic acid dehydratase (hemB) and GSA- 1-aminotransf erase (hemL) genes, 
complete cds . ] [LE:1838] [RE:2809] [DI:direct] 



ORF Name 



NTID 



IAI7503000992 25582512 t2 23 



TTUT 



_ _ _ _ NT AA 

AAID — _ — , score 

LengthLength 

[5877 |jT23 1 140 [ 



probability 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score 



IA17503000992 25596000 c3 111 



probability 
[58 7 8 |p5~~|| 274 || 460 | [T77e -43 



Description 

sp: [LN:HEMX_BACSU] [AC:P16645] [GN : HEMX] [OR:BACILLUS SUBTILIS] [DE : HEMX PROTEIN] 
[SP:P16645] [DB : swissprot] >pir : [LN:B35252] [AC : B35252 : C69640 ] [PN:hemA concentration 

negative effector hemX] [GNrhemX ] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI:gl43036] 
[LN : BACHEMAXC] [AC : M57676 : M32130] [GN:hemX] [OR:Bacillus subtilis] [SR : B . subtilis DNA] 
[DB:genpept-bctl] [DE:Bacillus subtilis hemAXCDBL gene cluster.] [NT unidentified gene 

product] [LE:1607] [RE:2437] [DIrdirect] >gp : [GI : ell84065 : g2635281] [LN : BSUB0015] 
[AC:Z99118 :AL009126] [PN : membrane -bound protein] [GN:hemX] [FN:negative effector of the 

concentration of HemA] [OR:Bacillus subtilis] [DB : genpept -bctl] [DE:Bacillus subtilis 

complete genome (section 15 of 21): from 2795131to 3013540.] [SP:P16645] [LE:80898] 
[RE:81728] [DI : complement] >gp: [GI :ell65382:gl 770082] [LN:BSZ75208] [AC:Z75208] 
[PN: membrane -bound protein] [GN:hemX] [OR:Bacillus subtilis] [DB : genpept -bctl] 
[DE:B. subtilis genomic sequence 89009bp.] [NT:membrane bound protein dispensable for 

heme] [SP:P16645] [LE:88179] [RE:89009] [DI:direct] 



ORF Name 



NT 



AA 



NTID AAID — — , , score probabili ty 
LengthLength c 



IAI7503000992 2866255 t3 65 



[5879 |p^T~||87 ||74 ||0 , 045 



Description 

gp: [GI:el250026:g2842472] [LN : SPBC2 0F10] [AC : AL021747] [PN : preg- like protein.] 
[GN:SPBC20F10 . 10] [OR : Schizosaccharomyces pombe] [SR: fission yeast] [DB : genpept -plnl] 
[DE:S.pombe chromosome II cosmid C20F10.] [NT : SPBC20F10 . 10 , len:243, 
SIMILARITY :Arabidopsis] [LE: 20474] [RE: 212 05] [DI: direct] 



ORF Name 



NTID 



AAID 



IA17503000992 31510 ±3 55 



TTUW 



"5OTTT 



NT AA 
LengthLength 



score 



TUT 



probability 
S.0e-4i 



Description 

Sp : [LN : 3MGA_HAEIN] [AC:P44321] [GN: TAG : HI 06 54] [OR : HAEMOPHILUS INFLUENZAE] 
[EC:3.2.2.20] [DE : GLYCOSYLASE) (TAG)] [SP:P44321] [DB : swissprot] >pir : [LN : G64084] 
[AC:G64084 ] [PN : 3 -methyladenine DNA glycosylase, I] [CL : 3 -methyladenine DNA 

glycosylase I] [OR : Haemophilus influenzae] [EC:3.2.2.-] [DB:pir2] >gp : [GI :gl573653] 
[LN:U32748] [AC : U32 748 : L42 023 ] [PN : DNA- 3 -methyladenine glycosidase I (tagl)] 
[GN:HI0654] [OR : Haemophilus influenzae Rd] [DB : genpept -bct2] [DE Haemophilus influenzae 

Rd section 63 of 163 of the complete genome.] [NT: similar to SP:P05100 GB:J02606 

GB:X03845 PID:147920] [LE:6773] [RE:7330] [DI : complement ] 



488 



NT AA 

ORFName NTID AAID Le ~ thLe ~ th probability 



[A17b03000992_3339b0b0_c2_9b | {2109 | [ Sflfli | |S18 | [ 205 | p7~ [| 5 . 3e-5 ^" 



Description 

sp: [LN:YSXC_BACSU] [AC:P38424] [GN:YSXC] [OR:BACILLUS SUBTILIS] [DE:(ORFX)] [SP:P38424] 
[DB:swissprot] >pir : [LN: 140422] [AC : 140422 : C69987 : S45102 ] [PN : conserved hypothetical 
protein ysxC] [GNrysxC ] [CL: Bacillus subtilis conserved hypothetical protein 
ysxC : translation elongation factor Tu homology] [ORrBacillus subtilis] [DB:pir2] 
>gp: [GI :g496558] [LN : BSLONLA] [ACrX76424] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:B. subtilis Ion gene for protease La.] [NT:orfX] [SP:P38424] [LEr2669] [RE:3256] 
[DIrdirect] >gp: [GI :ell84068:g2 635284] [LN: BSUB0015] [AC : Z99118 : AL009126] [GNrysxC] 
[FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis complete 
genome (section 15 of 21): from 2795131to 3013540.] [NT:similar to hypothetical 
proteins] [SP:P38424] [LE:83852] [RE:84439] [DI : complement] >gp: [GI : ell65379 : gl770079] 
[LN:BSZ75208] [AC:Z75208] [PN : hypothetical protein] [GN:ysxC] [OR: Bacillus subtilis] 
[DB:genpept-bctl] [DE : B . subtilis genomic sequence 89009bp.] [NT:unknown function; 
putative] [SP:P38424] [LE:85468] [RE:86055] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — , , score probability 
LengthLength c L - 



|AI7S03000992_33S17_ci_82 | [2110 | [5882 |pTT| | 3S8 | | 25fl |p~5e -2i | 

Description 

pir: [LN:E69834] [AC:E69834 ] [PN : hypothetical protein yhjN] [GN:yhjN ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GIrell83059rg2 633393] [LNrBSUB0006] [AC r Z99109 : AL009126] 

[GNryhjN] [FN r unknown] [ORrBacillus subtilis] [DB r genpept-bctl] [DEr Bacillus subtilis 
complete genome (section 6 of 21) r from 999501 tol209940.] [LErl30891] [RErl32045] 

[DI: complement] >gp r [GI re324987rg2 226186] [LN r BSY14081] [ACrY14081] [PN r hypothetical 
protein] [GNryhjN] [ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis 
chromosomal DNA, region 92 degrees r regionbetween comK and addAB.] [NTrTTG start; 
Similarity to a hypothetical protein] [LEr 13186] [RE r 14340] [DI : complement] 

NT AA 

ORF Name NTID AAID — — . , score probability 
LengthLength J ~ 



|Al7b03000992_34642092_cI_85 ] [2111 | [5883 |p5T]| 427 | pQ7 ||b.7e-91 



Description 

sp r [LN r FOLC_BACSU] [ACrQ05865] [GNrFOLC] [OR r BACILLUS SUBTILIS] [EC r 6 . 3 . 2 . 17] 
[DE r SYNTHETASE) (FPGS) ] [SPrQ05865] [DB r swissprot] >pir : [LN r B40646] 
[ACrB40646 r B33490 r D6 9626 ] [PN : f olyl -polyglutamate synthetase folC] [GNrfolC ] 
[ORrBacillus subtilis] [DBrpir2] >gp r [GI rgl42936] [LN r BACFOLC] [ACrL04520] 
[PNr f olyl -polyglutamate synthetase] [GNrfolC] [ORrBacillus subtilis] [SRrBacillus 
subtilis (substrain PY79, strain W168) DNA] [DB rgenpept-bctl] [DErBacillus subtilis 
valyl tRNA synthetase (valS) gene, 3* end cds ; f olyl -polyglutamate synthetase (folC) 
gene, complete cds; comCgene, 5 1 end cds.] [LEr231] [RErl523] [DIrdirect] 
>gpr [GIrell84057rg2635273] [LN r BSUB0015] [ACrZ99118 rAL009126] [PN r f olyl -polyglutamate 
synthetase] [GNrfolC] [FNr folate biosynthesis] [ORrBacillus subtilis] [DB rgenpept-bctl] 
[EC:6.3.2.17] [DErBacillus subtilis complete genome (section 15 of 21) r from 2795131to 
3013540.] [SPrQ05865] [LE: 69443] [RE: 70735] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ — ^ score probability 
LengthLength c L - 



|AI7503000992_3914012_c2_98 | [2112 | | b884 |[^u~[ |229 | | 710 |p?e -70 | 

Description 

gp: [GI:g2589182] [LN : SAU89396] [AC:U89396] [PN : uroporphyrinogen III synthase] [GN:hemD] 
[OR staphylococcus aureus] [DB : genpept -bct2 ] [EC : 4 . 2 . 1 . 75] [DE : Staphylococcus aureus 

hemCDBL gene cluster: porphobilinogendeaminase (heme) , uroporphyrinogen III synthase 
(hemD) , d-aminolevulinic acid dehydratase (hemB) and GSA-l-aminotransf erase (hemL) genes, 

complete cds.] [LE:1167] [RE: 1835] [DI:direct] 



489 



NT AA 

ORF Name NT ID AAID Le ~ thLe ^ t fr score probability 

|Al75030009S^40a044J_cJ_ili | |21i3 | [ 5885 | |129< T| | 431 | pMT]| 3 . 9e-211 "| 

Description 

sp: [LN:GSA_STAAU] [AC:O34092] [GN : HEML] [OR : STAPHYLOCOCCUS AUREUS] [EC:5.4.3.8] 
[DE: (GLUTAMATE-1- SEMI ALDEHYDE AMINOTRANSFERASE) (GSA-AT) ] [SP:O34092] [DB : swissprot] 
>gp: [GI:g2589184] [LN : SAU89396] [AC:U89396] [PN : GSA- 1 -aminotransferase] [GNihemL] 
[OR: Staphylococcus aureus] [DB :genpept-bct2] [EC: 5. 4. 3. 8] [DE : Staphylococcus aureus 
hemCDBL gene cluster: porphobilinogendeaminase (hemC) , uroporphyrinogen III synthase 
(hemD) , d-aminolevulinic acid dehydratase (hemB) and GSA- 1 -aminotransferase (hemL) genes, 
complete cds . ] [LE:2857] [RE: 4143] [DI: direct] 

NT AA 

ORF Name NT ID AAID — _ , — _ s core probability 
LengthLength c l ~ 



p^I75030005S2_4351fl3_tl_i4 | |2ii4 | [5886 |p?7~| |38 ||208 ^TZe^TT 



Description 

gp: [GI:ell65375:gl770076] [LN:BSZ75208] [AC:Z75208] [PN : hypothetical protein] [GN:ysoC] 
[OR:Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis genomic sequence 89009bp.] 
[NT:unknown function; putative] [LE:80592] [RE:81206] [DI : complement] 

NT AA 

ORF Name NT ID AAID Le ~ thLe ~ th score probability 



Al7b03000992_4729837_c2_92 [ [2115 | [ 5887 |p^"| | 207 | |415 | | 6.1e-39 



Description 

sp: [LN:YMAB_BACSU] [AC:P50619] [GN: YMAB] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 23.4 

KD PROTEIN IN NRDF-CWLC INTERGENIC REGION] [SP:P50619] [DB : swissprot] >pir : [LN : D69883 ] 
[AC:D69883 ] [PN : hypothetical protein ymaB] [GN:ymaB ] [OR:Bacillus subtilis] [DB:pir2] 
>gp: [GI:e218063:gll54634] [LN : BSNRDYMA] [AC:Z68500] [PN: YmaB] [GN:ymaB] [FN:unknown] 
[OR: Bacillus subtilis] [DB :genpept-bctl] [DE : B . subtilis cwlC, nrdE, nrdF, ymaA and ymaB 

genes.] [NT : no similarities, cannot be inactivated] [SP:P50619] [LE:4450] [RE:5070] 
[DI: direct] >gp: [GI : ell83399 :g2634124] [LN: BSUB0010] [AC : Z99113 : AL009126] [GN:ymaB] 
[FN:unknown] [OR:Bacillus subtilis] [DB : genpept - be t 1 ] [DE:Bacillus subtilis complete 

genome (section 10 of 21): from 1781201to 2014980.] [LE:90169] [RE:90789] [DI:direct] 

NT AA 

ORF Name NT ID AAID — , — . , score probability 

LengthLength c *~ 



AI7503000992_489i876_f3_45 | [2115 | [ 5888 || 141 | |46 | 



Description 
NO-HIT 



490 



NT AA 

ORF Name NTID AAID Le ~ thLe ^ t h score probability 

|AI7503000992J>0uub2_c3J.i8 | [ 2117 | [ 5889 | |222 || 440 | |1.8e-41 — | 

Description 

spr [LNrRADC_BACSU] [AC:Q02170] [GN : YSXA] [ORrBACILLUS SUBTILIS] [DE:DNA REPAIR PROTEIN 
RADC HOMOLOG (ORFB) ] [SP:Q02170] [DB : swissprot] >pir: [LN:B45239] 

[AC:B45239 :B45240 : 139881 :A69987 ] [ PN : DNA repair protein homolog ysxA] [GN:ysxA ] 
[CL : DNA repair protein radc] [ORrBacillus subtilis] [DB:pir2] >gp : [GI :gl42854] 
[LN : BACDIVREG] [AC:M96343] [GNrorfB] [ORrBacillus subtilis] [SR:Bacillus subtilis 
(substrain PY79, strain W168) DNA] [DB :genpept-bctl] [DErBacillus subtilis orfA, orfB, 

mreB, mreC, mreD, minC, and minDgenes, complete coding regions.] [NT : homologous to E. 

coli radC gene product and to] [LE:762] [RE:1457] [DIrdirect] >gp : [GI : gl43162] 

[LN : BACMAFMREB] [AC:L08793] [ORrBacillus subtilis] [SRrBacillus subtilis (strain W168) 

DNA] [DB:genpept-bctl] [DE: Bacillus subtilis maf gene, complete cds; orfB, complete 

cds; mreBgene, 5' end.] [NTrputative] [LE:1557] [RE:2252] [DI:direct] 

>gp: [GI :ell84053:g2 635269] [LN : BSUB0015] [AC : Z99118 : AL009126] [GN:ysxA] [FNrunknown] 
[ORrBacillus subtilis] [DB : genpept-bctl] [DE: Bacillus subtilis complete genome (section 
15 of 21): from 2795131to 3013540.] [NTrsimilar to DNA repair protein] [SP:Q02170] 
[LE : 65971] [RE : 66666] [DI : complement] 



NT AA 

ORF Name NTID AAID — _ — . score probability 
Length Length 



M!7S03000992_S26877B_c3_i06 | [2118 | [5890 jpSP~||31S ||13fl ||8.Se-07 



Description 

pir: [LN:H69986] [AC:H69986 ] [PN : hypothetical protein ysoA] [GN : ysoA ] [ORrBacillus 
subtilis] [DB:pir2] >gp : [GI : el 184073 :g2 635289] [LN : BSUB0015] [AC : Z99118 : AL009126] 
[GN: ysoA] [FN:unknown] [ORrBacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 
complete genome (section 15 of 21): from 2795131to 3013540.] [LE:91788] [REr92720] 
[DIr complement] >gp: [GI : ell65372 rgl770073] [LN:BSZ75208] [AC:Z75208] [PN : hypothetical 
protein] [GN : ysoA] [OR:Bacillus subtilis] [DB r genpept-bctl] [DE r B . subtilis genomic 
sequence 89009bp.] [NTrunknown function; putative] [LE:77187] [RE:78119] [DI:direct] 

— 

NT AA 

ORF Name NTID AAID — — score probability 

Length Length c -L 



■I7S03000992_B860827_c3_liO j [2119 | [ 5891 || 134T ] | 448 | pT7T]| 4 . 8e-119 



Description 

sp : [LN : HEM1_BACSU] [AC:P16618] [GN :HEMA] [OR:BACILLUS SUBTILIS] [EC:1.2.1.-] 
[DE : GLUTAMYL -TRNA REDUCTASE, (GLUTR) ] [SP:P16618] [DB : swissprot] >pir : [LN : A35252] 
[AC:A35252 :C69639 ] [PN : glutamyl -tRNA reductase, hemArhemA protein] [GN:hemA ] 
[CL: glutamyl -tRNA reductase] [ORrBacillus subtilis] [EC :1.2 . 1. -] [DB:pir2] 
>gp: [GI:gl43035] [LN r BACHEMAXC] [AC rM57676 rM32130] [PN : NAD (P) H : glutamyl- transfer RNA 
reductase] [GN:hemA] [ORrBacillus subtilis] [SR : B . subtilis DNA] [DB r genpept-bctl] 
[DErBacillus subtilis hemAXCDBL gene cluster.] [NTrThe product of this hemA gene is not 
an] [LEr232] [RErl599] [DIrdirect] >gp r [GI : ell84066 : g2635282] [LN r BSUB0015] 
[AC:Z99118 rAL009126] [PNr glutamyl -tRNA reductase] [GNrhemA] [FNrporphyrin biosynthesis] 
[ORrBacillus subtilis] [DB : genpept-bctl] [EC:1.2.1.-] [DErBacillus subtilis complete 
genome (section 15 of 21) : from 2795131to 3013540.] [SP.-P16618] [LEr81736] [REr83103] 
[DIr complement] >gpr [GI r el 16 53 81 rgl770081] [LNrBSZ75208] [ACrZ75208] 
[PNrNAD(P)Hrglutamyl-transfer RNA reductase] [GNrhemA] [ORrBacillus subtilis] 
[DBr genpept-bctl] [EC : 2 . 3 . 1 . 37] [DE : B . subtilis genomic sequence 89009bp.] 
[NT rNAD (P) H r glutamyl -t -RNA reductase of the C-5 pathway] [SPrP16618] [LEr 86804] 
[REr88171] [DI rdirect] 



491 



ORF Name 



NTID 



NT AA 
AAID — _ — , , score 
Length Length 



probability 



F 



7503000992 6B4bJ^2 r2 2<> 



2120 | [5892 ||219~]|72 |p ||0 , 034 



Description 

gp: [GI:el358367:g3979946] [LN : CEY18D10A] [AC : AL034393] [GN: Y18D10A. 14] 
[OR:Caenorhabditis elegans] [DB :genpept-invl] [DE : Caenorhabditis elegans cosmid 

Y18D10A, complete sequence.] [LE : 113813 : 114013 : 114203] [RE : 113961 : 114112 : 114448] 
[DI : direct Join] 



ORF Name 



NT AA 

NTID AAID — - — , , score probability 
Length Length ^ L 



|Ai7503000992 7050319 c2 90 



2T2T" 



|5893 |P^~| 



ITS" 



8 .4e-19 



Description 

sp: [LN:RL3 5_BACSU] [AC:P55874] [GN : RPMI] [OR:BACILLUS SUBTILIS] [DE:50S RIBOSOMAL 
PROTEIN L35] [SP:P55874] [DB : swissprot] 



ORF Name 

|AI7503000992_7283437_ciJj9 

Description 
NO-HIT 



2122 



NT AA 
Len 



NTID AAID Le ^ thLe ^ th score probability 
^^~||152 | 



ORF Name 



NTID 



NT 



AA 



E 



17503000992 806510 Cl 86 



fTTTT 



^ Len^thLen^th ^^ probability 
[5895 |[7T^~|| 237 | | 189 || V.0e-15 



Description 

sp: [LN:LEP3_BACSU] [AC:P15378] [GN:COMC] [OR:BACILLUS SUBTILIS] [EC : 3 . 4 . 99 . - ] [DE : (LATE 
COMPETENCE PROTEIN COMC) ] [SP:P15378] [DB : swissprot] >pir : [LN: A33490] 
[AC:A33490:E40646:F40646:B69602 ] [PN:type IV prepilin peptidase, : genetic 
transformation late competence protein ComC] [GN:comC ] [CL:type IV prepilin peptidase] 
[OR:Bacillus subtilis] [EC : 3 . 4 . 99 . - ] [DB:pir2] >gp : [GI : gl42704] [LN:BACCOMC] 
[AC:M30805] [PN:late competence protein] [GNrcomC] [OR:Bacillus subtilis] [SR:Bacillus 
subtilis (strain IS75) DNA] [DB : genpept-bctl] [DE:Bacillus subtilis late competence 
protein (comC) gene, completecds . ] [LE:819] [RE: 1565] [DI: direct] 

>gp: [GI:ell84056:g2635272] [LN : BSUB0015] [AC : Z99118 : AL009126] [PN : DNA- binding protein] 
[GN:comC] [FN:required for the processing and translocation] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE:Bacillus subtilis complete genome (section 15 of 21) : from 
2795131to 3013540.] [SP:P15378] [LE:68557] [RE:69303] [DI : complement] 



ORF Name 



NTID 



AAID 



[AI7503000992 820253 tl 15 



NT AA 
LengthLength 



score 



SB" 



probability 
i.ie-ifi 



Description 

gp: [GI:ell65375:gl770076] [LN : BSZ752 08] [AC:Z75208] [PN : hypothetical protein] [GN:ysoC] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE :B . subtilis genomic sequence 89009bp.] 
[NT:unknown function; putative] [LE:80592] [RE:81206] [DI : complement] 



ORF Name 



NTID 



AAID 



IAI7503000992 978562 cl 79 



NT AA 
LengthLength 
[5897 ||g^T [ |285 



score probability 
1261 ||1 . 8e-128 



Description 

sp: [LN:HEM2_STAAU] [AC:P50915] [GN : HEMB] [OR : STAPHYLOCOCCUS AUREUS] [EC : 4 . 2 . 1 . 24] 
[DE : SYNTHASE) (ALAD) (ALADH) ] [SP:P50915] [DB : swissprot] >gp : [GI : g2 589183] 
[LN:SAU89396] [AC:U89396] [PN : d- aminolevulinic acid dehydratase] [GN:hemB] 
[OR staphylococcus aureus] [DB : genpept-bct2] [EC : 4 . 2 . 1 . 24] [DE : Staphylococcus aureus 
hemCDBL gene cluster: porphobilinogendeaminase (hemC) , uroporphyrinogen III synthase 
(hemD) ,d- aminolevulinic acid dehydratase (hemB) and GSA-l-aminotransf erase (hemL) genes, 
complete cds . ] [LE:1838] [RE: 2809] [DI:direct] 
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ORF Name 

[A17503 000993_lU266 87b_C2_761 



NT ID 



NT AA 
AAID — _ — . score 
LenqthLength 



probability 



Description 
NO-HIT 



j |212S | [5898 |[T55~1|5l "| 



ORF Name 



NTID 



TTZT 



AAID 



NT AA 
Length Length 
[5899 [pre | |167 



score probability 
3.1e-28 



[AI7503 000993_i04<53_c2_803 

Description 

Sp: [LN : YTXG_BACSU] [AC:P40779] [GN : YTXG] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 15.7 
KD PROTEIN IN MURC-AROA INTERGENIC REGION (ORF1) ] [SP:P40779] [DB : swissprot] 
>pir: [LN:D70003] [AC : D70003 : S71001 ] [PN:general stress protein homolog ytxG] [GN:ytxG 
] [OR : Bacillus subtilis] [DB:pir2] >gp : [GI :g556015] [LN : BACUNAM] [AC:L31845] 
[ORrBacillus subtilis] [DB :genpept-bctl] [DErBacillus subtilis 

UDP-N-acetylmuramate-alanine ligase gene, partialcds, and 3 ORF's.] [NT :0RF1] [LE:1211] 
[RE:1642] [DI:direct] >gp: [GI : ell85851 : g2635462] [LN : BSUB0016] [AC : Z99119 : AL009126] 
[GNiytxG] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DErBacillus subtilis 
complete genome (section 16 of 21) : from 2997771to 3213410.] [NT:alternate gene name: 
csb40; similar to general] [SP:P40779] [LE: 48897] [RE: 49328] [DI : complement] 
>gp: [GI:g2293217] [LN: AF008220] [AC : AF008220] [PN: YtxG] [GNrytxG] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DErBacillus subtilis rrnB-dnaB genomic region.] 
[LE : 131098] [RE r 13152 9] [DI r direct] 



ORF Name 



NTID 



„ TO NT AA 

AAID — _ — , , score 

Length Length 



probability 



Al7b03000993_10548383_tl_3 

Description 

NO-HIT 



J pi28 | |5900 |pT7 | [78 I 



ORF Name 


NTID 


_ _ __ NT AA 

AAID , , 

Length Length 


score 


probability 


AI7b03000993_i0564375_t3_S10 


| 2129 


| |5901 |123 | |4 0 | 






Description 
NO-HIT 










ORF Name 


NTID 


7V7vTrN NT AA 

AAID , , 

LengthLength 


score 


probability 


A17b03000993_1056693_c2_793 


2130 


|5902 ||110i||366 | 


|774 


7.1e-77 | 



Description 

pirr [LN:B69998] [AC:B69998 ] [PN: endo-1 , 4-beta-glucanase homolog ytoP] [GNrytoP ] 
[CLr thermophilic aminopeptidase I alpha chain] [ORrBacillus subtilis] [DBrpir2] 
>gpr [GI:ell85859:g2635470] [LN r BSUB0016] [AC : Z99119 r AL009126] [GNrytoP] [FNrunknown] 
[ORrBacillus subtilis] [DB r genpept-bctl] [DErBacillus subtilis complete genome (section 
16 of 21) r from 2997771to 3213410.] [NTrsimilar to endo-1, 4 -beta-glucanase] [LEr56559] 
[REr57632] [DI r complement] >gp r [GI r g2293210] [LN : AF008220] [AC r AF008220] [PNrYtoP] 
[GNrytoP] [ORrBacillus subtilis] [DB : genpept -bct2 ] [DErBacillus subtilis rrnB-dnaB 
genomic region.] [NTrsimilar to hypothetical protein f356 from E. coli] [LErl22794] 
[RE:123867] [DI: direct] 



ORF Name 



NTID 



AAID — 



IA17&03000993 10588877 tl b2 



][ 



fZTTT 



Description 
NO-HIT 



NT AA 
LengthLength 
[5903 |p^~| [45 



score probability 
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ORF Name 



NT AA 

NTIP AAI5 Le ^thLe^th ^^ probability 



117503000993 10667002 tl ill 



Description 
NO-HIT 



J [2132 | [5904 [[155 1 



ORF Name 



NTID 



NT AA 
AAID — _ — , score 
Length Length 



probability 



^17503000993 109430 r.3 4«b 



Description 
NO-HIT 



[2133 | [5905 ||144 [ [47 | 



ORF Name 



NTID 



NT 



AA 



F 



17503000993 10969427 13 448 



Description 
NO-HIT 



AAID 

LengthLength 

[2134 | [5906 [[155 | [51 [ 



score 



probability 



ORF Name 



NT AA 
NTID AAID — _ „ — , , score 
LengthLength 



probability 



IAI7503000993 112525 £3 545 



Description 
NO-HIT 



[2135 | [5907 |pn~ |p" 



ORF Name 



NT AA 

NTID AAID — „. — , - score probabil ity 
LengthLength 



[AI7503000993 116337 12 251 



5908 "11627 I 



T99" 



G.le-16 



Description 

gp: [GI:g3043872] [LN: LLU95837] [AC:U95837] [PN : transmembrane protein Tmp3] 
[OR:Lactococcus lactis] [DB : genpept-bct2] [DE : Lactococcus lactis transmembrane protein 
Tmp3 gene, partial cds.] [NT : PBP1A homolog; identified as a fusion to a signal] [LE:<1] 
[RE : 588] [DI : complement] 



ORF Name 



VI7503000993 11881325 13 56S 



NTID 



TTS NT ^ 

AAID — — , score 
LengthLength 



probability 



fZTTT 



[5909 pg" 



Description 
NO-HIT 



ORF Name 



NTID 



MTT ^ NT AA 

AAID — _ — , score 

LengthLength 



IAI7503000993 1203827 c3 893 



J [2138 [ [5910 [[539 [[212 [ 



T7T 



probability 
3.4e-45 



Description 

sp: [LN:RISA_ACTPL] [AC:P50854] [GN : RIBE : RIBB] [OR : ACTINOBACILLUS PLEUROPNEUMONIAS ] 
[SR: , HAEMOPHILUS PLEUROPNEUMONIAS] [EC: 2. 5. 1.9] [DE : RIBOFLAVIN SYNTHASE ALPHA CHAIN,] 
[SP:P50854] [DB : swissprot] >gp : [GI :gll73517] [LN: APU27202] [AC:U27202] [PN : riboflavin 
synthase alpha subunit] [GN:ribB] [OR : Actinobacillus pleuropneumoniae] 
[DB:genpept-bctl] [DE : Actinobacillus pleuropneumoniae riboflavin biosynthesis 
operon, ribof lavin-specif ic deaminase (ribG) , riboflavin synthase alphasubunit (ribB) , 
bifunctional GTP cyclohydraseII/3 , 4-dihydroxy-2-butanone- 4 -phosphate synthase (ribA) , 
andribof lavin synthase beta subunit (ribH) genes, complete cds.] [NT:lumazine synthase; 
similar to Bacillus subtilis] [LE:1685] [RE: 2332] [DI: direct] 



494 



ORF Name 



NTID 



NT AA 
AAID — _ T — . , score 
LengthLength 



|A17b03000993 1209417 c3 9b9 



probability 
] [2139 | [5911 ||T7uT]p57 ~~||797 p.fee-79 



Description 

sp: [LN:PHOR_BACSU] [AC:P23545] [GN:PHOR] [OR:BACILLUS SUBTILIS] [EC:2.7.3.-] 
[DE : ALKALINE PHOSPHATASE SYNTHESIS SENSOR PROTEIN PHOR,] [SP:P23545] [DB : swissprot] 
>pir: [LN:A27650] [AC : A27650 : G69676 ] [PN : phosphate response regulator histidine kinase 
phoR] [GN:phoR ] [OR : Bacillus subtilis] [DB:pir2] >gp : [GI :gl43331] [LN : BACPHORP] 
[AC:M23549] [PN:alkaline phosphatase regulatory protein] [GN:phoR] [OR:Bacillus 
subtilis] [SR:Bacillus subtilis DNA] [DB :genpept-bctl] [DE:Bacillus subtilis alkaline 
phosphatase regulatory protein (phoPgene, 3' end and phoR gene, complete cds).] [LE:85] 
[RE:1824] [DI:direct] >gp : [GI :e!184159:g2 635375] [LN : BSUB0015] [AC : Z99118 : AL009126] 
[PN: two -component sensor histidine kinase] [GN:phoR] [FN: involved in phosphate 
regulation] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus subtilis complete 
genome (section 15 of 21): from 2795131to 3013540.] [SP:P23545] [LE:180011] [RE:181750] 
[DI: complement] >gp : [GI :g2293271] [LN : AF008220] [AC : AF008220] [PN:signal transduction 
protein kinase] [GN :phoR] [OR:Bacillus subtilis] [DB :genpept-bct2 ] [DE:Bacillus 
subtilis rrnB-dnaB genomic region.] [LE: 201317] [RE: 203056] [DI: direct] 



ORF Name 



E 



17503000993 125880 t2 351 



Description 
NO-HIT 



NT 



AA 



NTID AAID — — . - score probabili ty 
LengthLength c L - 



2140 | |5912 ||309 | [102 [ 



ORF Name 



U7503000993 1284381 cl 552 



NTID 



NT 



AA 



AAID . , — 



LengthLength 



score 



[2141 | [ 5913 |[7n~| |246 [ | 297 | [ 



probability 
|2.5e-26 



Description 

sp : [ LN : TAL_MET JA] [AC:Q58370] [GN : TAL : MJ0960] [OR : METHANOCOCCUS JANNASCHII] 
[EC:2.2.1.-] [DE : TRANSALDOLASE-LIKE PROTEIN,] [SP:Q58370] [DB : swissprot] 
>pir: [LN:H64419] [AC:H64419 ] [PN : transaldolase , ] [CL:Bacillus subtilis 23K 
phosphoprotein orfU] [OR : Methanococcus jannaschii] [EC:2.2.1.2] [DB:pir2] 
[MP:REV892437-891784 ] >gp : [GI : gl591624] [LN:U67539] [AC : U67539 : L77117] 
[PN: transaldolase] [GN:MJ0960] [OR : Methanococcus jannaschii] [DB :genpept-bct2] 
[DE : Methanococcus jannaschii section 81 of 150 of the complete genome.] [NT: similar to 
SP:P19669 PID:853766 GB:AL009126 percent] [LE:7990] [RE:8643] [DI : complement] 



ORF Name 



NTID 



_ _ _ NT AA 

AAID — — score 

LengthLength 



probability 



|A^7503000993 12894378 t2 347 



] [2142 [ [ 5914 | |177 "| [58 | 



Description 
NO-HIT 



ORF Name 

|AI7503000993_13001537_i3_444 

Description 

NO-HIT 



NT AA 
NTID AAID T — 1-T — M score 
LengthLength 

[2143 | [ 5915 | prr~| p | 



probability 



ORF Name 



A17503000993_130890b2_t3_538 

Description 

NO-HIT 



NT AA 
NTID AAID — _ — , score 
LengthLength 

[2144 | [5915 ||141~||46 | 



probability 
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ORF Name 



IAI7503000993 13b2042 c2 777 



Description 
NO-HIT 



NT AA 
NT ID AAID — . — . , score 
Length Length 



probability 



] p 145 | | 5917 II 1 ** 0 "! P I 



ORF Name 



^7503000993 13678452 ci 628 



NTID AAID , . , . — 



2146 I 



NT AA 
LengthLength 



score 



probability 
|2.9e-41 



Description 

sp: [LN:ECSB_BACSU] [AC:P55340] [GN : ECSB : PRST] [OR : BACILLUS SUBTILIS] [DE : PROTEIN ECSB] 
[SP:P55340] [DB : swissprot] >pir : [LN:G69619] [AC:G69619 ] [PN:ABC transporter (membrane 
protein) ecsB] [GN : ecsB ] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : el83450 :gll77254] 
[LN : BSECSABCG] [AC:X87807] [PN : hypothetical EcsB protein] [GN : ecsB] [OR:Bacillus 
subtilis] [DB:genpept-bctl] [DE : B . subtilis ecsA, ecsB, and ecsC genes.] [SP:P55340] 
[LE:984] [RE:2210] [Dlidirect] >gp : [GI :ell83007:g2 633341] [LN : BSUB0006] 
[AC:Z99109 :AL009126] [PN:ABC transporter (membrane protein)] [GN : ecsB] [FN: regulates 
both components of the protein] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 6 of 21): from 999501 tol209940.] [NT : alternate gene 
name: prsT, yhaC] [SP:P55340] [LE:78149] [RE:79375] [DI:direct] 

>gp: [GI:e324951:g2226114] [LN : BSY14077] [AC:Y14077] [PN: Hypothetical protein] [GN:yhaC] 
[OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subitlis 10.6 Kb chromosomal DNA: 
glyB-prsA region.] [NT : Identified as ecsB. Hypothetical integral membrane] [SP:P55340] 
[LE : 1538 ] [RE : 2 764] [DI : complement] 



NT AA 

ORF Name NTID AAID — _„ — L score probability 
LengthLength -L - 



|&I7S03000993_1372233fl_c3J)43 | [2147 [ [5919 |pT7 — [ | | 505 | |2.3e-48 1 

Description 

pir: [LN:F69992] [AC:F69992 ] [PN:thiol peroxidase homolog ytgl] [GN:ytgI ] [CL:thiol 
peroxidase] [OR:Bacillus subtilis] [DB:pir2] >gp : [GI : ell85822 :g2635433] [LN : BSUB0016] 
[AC:Z99119:AL009126] [GN:ytgI] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] 
[DE:Bacillus subtilis complete genome (section 16 of 21): from 2997771to 3213410.] 
[NT:similar to thiol peroxidase] [LE:18998] [RE:19501] [DI : complement] 
>gp: [GI:g2293238] [LN : AF008220] [AC : AF008220] [PN:YtgI] [GN:ytgI] [OR:Bacillus 
subtilis] [DB:genpept-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic region.] 
[NT similarity to tagD protein from V.cholerae] [LE:160926] [RE:161429] [DI:direct] 



NT AA 

ORF Name NTID AAID — «^ — , . score probability 
LengthLength c L - 



[Ai:7503000953_i37233ifl_c3_545 | [2148 | [ 5920 |pT^] |382 | | 1230 | |T~?e -i2S — 

Description 

pir: [LN:JE0388] [AC:JE0388 ] [PN:alanine dehydrogenase,] [OR : Enterobacter aerogenes] 
[EC:1.4.1.1] [DB:pir3] >gp : [GI : dl 04 12 7 9 :g4803749] [LN: AB013821] [AC : AB013821] 
[PN:alanine dehydrogenase] [GN:aladh] [OR : Enterobacter aerogenes] [SR : Enterobacter 
aerogenes DNA] [DB : genpept-bctl] [EC: 1.4. 1.1] [DE : Enterobacter aerogenes aladh gene for 
alanine dehydrogenase, complete cds . ] [LE:174] [RE: 1307] [Dlidirect] 
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ORF Name 

IAI7503000993 13797912 c3 891 



NTID 



NT AA 
AAID — « — , , score 
Length Length 



] pi49 | |592i ||i524 | prr 



TT5~ 



probability 
I.Ve-05 



Description 

pir: [LN:D64924] [AC:D64924 ] [PN : hypothetical protein bl668] [OR : Escherichia coli] 
[DB:pir2] >gp: [GI:gl549279] [LN: ECU68703 ] [AC:U68703] [OR : Escherichia coli] 
[DB :genpept-bctl] [DE : Escherichia coli K-12 MG1655 genome, ribC-pykF region.] 
[NT: hypothetical protein] [LE:4535] [RE:6139] [DI:direct] >gp : [GI :gl787957] 
[LN:AE000262] [AC : AE000262 :U00096] [PN:orf, hypothetical protein] [GN:bl668] [FN:orf; 
Unknown] [OR : Escherichia coli] [DB : genpept-bct2] [DE : Escherichia coli K-12 MG1655 
section 152 of 400 of the completegenome . ] [NT:o534; This 534 aa ORF is 38 pet 
identical (6 gaps)] [LE:872] [RE:2476] [DI: direct] 



ORF Name 



NTID 



NT 



AA 



AAID _ . , _ 



LengthLength 



score 



IAI7503000993 13852187 tl 136 



] 



[2TT5TT 



|5922 |[T35 | fST" 



TIT 



probability 
|4.5e-0S 



Description 

pir: [LN:S77632] [AC : S77632 : S52761 ] [PN:probable integrase] [GN:int ] 
[OR : Staphylococcus aureus phage phi-13] [DB:pir2] >gp : [GI :g758229] [LN: PHI13INT] 
[AC:X82312] [PN : integrase] [GN:int] [FN: integration of phi-13 in S. aureus genome] 
[OR bacteriophage phi-13] [DB :genpept-phg] [DE : Bacteriophage phi-13 integrase gene.] 
[LE:461] [RE:1498] [DI:direct] 



ORF Name 



NTID 



AAID 



NT 



AA 



LengthLength 



score probability 



|Al7b03000993 13939027 t3 560 



Description 
NO-HIT 



-| [2151 | [5923 |pW~||61 | 



ORF Name 



NT 



AA 



NTID AAID — . T — _ 
LengthLength 



score 



117503000993 14251933 12 239 



[5924 | 



TFT 



probability 
i.5e-8i 



Description 

pir : [LN:A37146] [AC : A3 7146 : A44901 : S11354 :C6 9699: 139962 ] [PN : ribosomal protein 
S4:ribosomal protein BS4 (rpsD) ] [GN : rpsD ] [CL : Escherichia coli ribosomal protein S4] 
[OR:Bacillus subtilis] [DB:pir2] >gp : [GI : gl43467] [LN: BACRPSD] [AC : M59358 : M34718] 
[PN:ribosomal protein S4] [GN : rpsD] [OR:Bacillus subtilis] [SR:B . subtilis (strain 168, 
isolate BR151) DNA] [DB :genpept-bctl] [DE : B . subtilis ribosomal protein S4 gene, 
complete cds and tyrosyl tRNAsynthetase (tyrS) gene, 3' end.] [LE:756] [RE:1358] 
[DI:direct] >gp : [GI : el 185839 :g2 635450] [LN : BSUB0016] [AC : Z99119 : AL009126] [PN : ribosomal 
protein S4 (BS4 ) ] [GN: rpsD] [OR:Bacillus subtilis] [DB :genpept-bctl] [DE:Bacillus 
subtilis complete genome (section 16 of 21): from 2997771to 3213410.] [SP:P21466] 
[LE:37035] [RE:37637] [DIrdirect] >gp : [GI : g22 93 319] [LN : AF008220] [AC : AF008220] 
[PN: ribosomal protein S4] [GN : rpsD] [OR:Bacillus subtilis] [DB : genpept-bct2] 
[DE: Bacillus subtilis rrnB-dnaB genomic region.] [LE: 142790] [RE: 143392] 
[DI : complement] 



ORF Name 



NTID 



NT 



AA 



AAID — 

LengthLength 



score 



IAI7503000993 14460877 c3 963 



][ 



[5925 [ps - 1|45 |pT 



probability 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT :0RF1] 

[LE : 1101] [RE : 1922] [DI : complement] 
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ORF Name 

IAI7503000993 14460882 cl 649 



NTID 



NT AA 
AAID — 1 _. T — , score 
LengthLength 



probability 



[2154 | [5925 |[T7T~| p 1 p7~ || 6 . 

Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB :genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds . ] [NT : ORF1] 

[LErllOl] [RE: 1922] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — , — . score 
LengthLength 



probability 



E 



17503000953 14460932 cl 612 



[5927 jprrT — [ |58 |pTT-~|p.3e-i7 



Description 

gp: [GI:gl022726] [LN : SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR: Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS1272 ORF1 and ORF2 genes, completecds.] [NT : ORF1] 

[LE : 1101] [RE : 1922 ] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — . T — , , score 
LengthLength 



1AI7503000993 14460932 12 226 



][ 



5928 11168 I 



probability 
1.3e-15 



Description 

gp: [GI:gl022726] [LN: SHU35635] [AC:U35635] [PN:unknown] [OR : Staphylococcus 
haemolyticus] [SR : Staphylococcus haemolyticus strain=Y176] [DB : genpept-bctl] 

[DE: Staphylococcus haemolyticus IS12 72 ORF1 and ORF2 genes, completecds.] [NT : ORF1] 

[LE:1101] [RE: 1922] [DI : complement] 



ORF Name 



NT AA 
NTID AAID — _ ^ — , , score 
LengthLength 



probability 



LA17503000993 14656327 c2 751 



] [2157 | [5929 |p^"| |207 | [7^~||1 . 6e-79 



Description 

gp: [GI:gl916729] [LN : AF134 905] [AC : AF134905 :U76550] [PN:CadD] [GN: cadD] 
[OR staphylococcus aureus] [DB:genpept-bct2] [DE : Staphylococcus aureus plasmid pRWOOl 
CadD (cadD) gene, completecds.] [NT:confers low level cadmium resistance] [LE:2328] 
[RE:2957] [DI:direct] 



ORF Name 



IAI7503000993 14658152 13 589 



Description 
NO-HIT 



ORF Name 



IAI7503000993 14742937 c2 774 



Description 
NO-HIT 
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AAID — — _ score 
LengthLength 



probability 
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75" 
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NT AA 

0RF Name ™ ^ L e^thLe^th ^^ probability 



|AI7503000993_14877316_tlJ>4 | [2160 | [ 5932 || 483 "| pP [| 407 | |b.be-38 | 

Description 

sp : [LN : YEBR_ECOLI] [AC : P76270 : 007976 :O07978] [GN : YEBR] [OR : ESCHERICHIA COLI] 
[DE: HYPOTHETICAL 20.3 KD PROTEIN IN PRC-PPHA INTERGENIC REGION] 

[SP:P76270:O07976:O07978] [DB : swissprot] >pir : [LN : H64944] [AC:H64944 ] [PNrprobable 
membrane protein bl832] [CL : hypothetical protein YKL069w] [OR : Escherichia coli] 

[DB:pir2] >gp : [GI : dl016364 :gl736473] [LN:D90826] [AC :D90826 : ABO 01 34 0] [GN:YKL069W, 
YKL340] [OR: Escherichia coli] [SR: Escherichia coli (strain:K12) DNA, clone_lib : Kohara 
lambda minise] [DB : genpept-bctl] [DE:E.coli genomic DNA, Kohara clone #335(40.9-41.3 
min.).] [NT:ORF_ID:o335#13; similar to [SwissProt Accession] [LE:16795] [RE:17346] 

[DI : complement] >gp: [GI :dl 016369 :gl 736479] [LN:D90827] [AC :D9082 7 : ABO 01 340] 

[GN:YKL069W, YKL340] [OR : Escherichia coli] [SR : Escherichia coli (strain:K12) DNA, 
clone_lib: Kohara lambda minise] [DB : genpept-bctl] [DE:E.coli genomic DNA, Kohara clone 
#336(41.2-41.6 min.).] [NT:ORF_ID:o33 5#13 ; similar to [SwissProt Accession] [LE:2748] 

[RE:3299] [DI : complement] >gp: [GI :gl788136] [LN : AE000277] [AC : AE0002 77 :U00096] [PN:orf , 
hypothetical protein] [GN:bl832] [FN:orf; Unknown] [OR : Escherichia coli] 

[DB:genpept-bct2] [DE : Escherichia coli K-12 MG1655 section 167 of 400 of the 
completegenome. ] [NT: f 183; residues 72-127 are 57 pet identical to] [LE:7076] [RE: 7627] 

[DI : complement] 

NT AA 

ORF Name NTID AAID — — ^ score prob ability 
LengthLength c 



A!i:7g03000993_i4882928_c3_923 | |2igl | [5933 || 10Sfl | |365 | [ 1279 || 2 . 2e-i "7Tr 



Description 

gp: [GI:e220317:gll77684] [LN:SXCCPA] [AC:X95439] [PN : chorismate mutase] [GN: aroA] 
[OR staphylococcus xylosus] [DB : genpept-bctl] [EC : 5 . 4 . 99 . 5] [DE : S . xylosus aroA, ccpA, 
acuC and acuA genes.] [LE:<1] [RE: 807] [DI:direct] 

NT AA 

ORF Name NTID AAID — _ „ — , , score prob ability 
LengthLength c *- 

A!T7g03000 993_15550303_c3J182 | [2152 | | b934 | p2TT7 — 1 | 68 | 



Description 
NO-HIT 



NT AA 

ORF Name NTID AAID Le ~ thLe ~ th score probability 

AI7B03000953_1631Si_c3_aS6 | [2163 | [ 5935 |pTF] |415 | [ 1800 || i . 3e-18b — | 

Description 

sp : [LN : METK_STAAU] [AC:P50307] [GN :METK] [OR : STAPHYLOCOCCUS AUREUS] [EC:2.5.1.6] 
[DE : ADENOSYLTRANSFERASE) (ADOMET SYNTHETASE)] [SP:P50307] [DB : swissprot] 
>gp: [GI:gl020317] [LN : SAU36379] [AC:U36379] [PN : S-adenosylmethionine synthetase] 
[FN: catalyses the synthesis of SAM] [OR: Staphylococcus aureus] [DB : genpept-bctl] 
[DE: Staphylococcus aureus S-adenosylmethionine synthetase gene , complete cds . ] [NT : SAM 
synthetase] [LE:212] [RE: 1405] [DI: direct] 



NT AA 

ORF Name NTID AAID — , — score probability 
LengthLength c L - 



AI7S03000993jL6B33442_c2J73S | [2154 | [5936 | p2~ | [ 173 | [5?5~|| 1 ■ Oe-52 



Description 

gp: [GI:gl381681] [LN: BSU58864] [AC:U58864] [PN:CspR] [GN:cspR] [OR:Bacillus subtilis] 
[SR:Bacillus subtilis strain=JH642] [DB : genpept-bctl] [DE:Bacillus subtilis methylase 

homolog (cspR) gene, complete cds.] [NT:methylase homolog] [LE:573] [RE: 1046] 
[DI : direct] 
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ORF Name 



NT ID 



_ _ _ _^ NT AA 

AAID — — .score 

LengthLength 



IAI7503000993 16586012 12 308 



j [2165 | [5937 ||5"85 | [155 [|456 | f 



probability 
|3.5e-43 



Description 

pir: [LN:E69999] [AC:E69999 ] [PN : hypothetical protein ytqB] [GNrytqB ] [OR:Bacillus 

SUbtilis] [DB:pir2] >gp : [GI : ell85922 :g2635533] [LN: BSUB0016] [AC : Z99119 : AL009126 ] 
[GN:ytqB] [FN:unknown] [OR:Bacillus subtilis] [DB : genpept-bctl] [DE:Bacillus subtilis 

complete genome (section 16 of 21) : from 2997771to 3213410.] [LE:122275] [RE:122859] 
[DI: direct] >gp : [GI : g2293301] [LN : AF008220] [AC : AF008220] [PN: YtqB] [GN:ytqB] 
[OR:Bacillus subtilis] [DB : genpept-bct2] [DE:Bacillus subtilis rrnB-dnaB genomic 

region.] [LE:57568] [RE:58152] [DI : complement] 



ORF Name 



^7503000993 155908 t2 353 



NT ID 



^ m NT AA 

AAID — — score 

LengthLength 



[5938 |pT7 | |168 | fTST" 



probability 
3 . 8e-14 



Description 

sp : [LN : YHGC_BACSU] [AC:P38049] [GN:YHGC] [OR:BACILLUS SUBTILIS] [DE : HYPOTHETICAL 18.8 
KD PROTEIN IN ECSC-PBPF INTERGENIC REGION] [SP:P38049] [DB : swissprot] >pir : [LN : B40614 ] 
[AC:B40614 :F69832 ] [PN : conserved hypothetical protein yhgC : hypothetical protein X 
(pbpF 5* region)] [GN:yhgC ] [OR:Bacillus subtilis] [DB:pir2] >gp: [GI:g304160] 
[LN : BACPBPF] [AC:L10630] [OR:Bacillus subtilis] [SRrBacillus subtilis (strain W168) 
DNA] [DB: genpept-bctl] [DE:Bacillus subtilis penicillin-binding protein (pbpF) gene, 5' 
end.] [NT: product unknown] [LE:247] [RE:747] [DI : complement] >gp : [GI : ell83012 : g2633346] 
[LN:BSUB0006] [AC : Z99109 : AL009126 ] [GN:yhgC] [FN:unknown] [OR:Bacillus subtilis] 
[DB:genpept-bctl] [DE: Bacillus subtilis complete genome (section 6 of 21) : from 999501 
tol209940.] [NT: alternate gene name: yixC; similar to hypothetical] [SP:P38049] 
[LE:83202] [RE:83702] [DI : complement] >gp : [GI : e3 25006 :g2 226228] [LN : BSY14083] 
[AC:Y14083] [PN : Hypothetical protein] [GN:yixC] [OR:Bacillus subtilis] 
[DB: genpept-bctl] [DE: Bacillus subtilis chromosomal DNA, region 76-78 degrees: 
betweenglyB-aprE. ] [NT: See Swiss Prot P38049; YIXC_BACSU] [SP:P38049] [LE:3061] 
[RE: 3561] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — — score 
LengthLength 



probability 



^7503000993 16835053 c2 758 



Description 
NO-HIT 



[2167 | [5939 |P^~| |S5 | 



ORF Name 



NT 



AA 



NTID AAID — , — 



LengthLength 



score 



LL7503000993 16994043 c3 933 



] 



[2T51T 



[5940 ||ii28||375 |jgl" 



probability 
10.0062 



Description 

pir: [LN:E69106] [AC:E69106 ] [PN : hypothetical protein MTH1793] [GN:MTH1793 ] 
[OR:Methanobacterium thermoautotrophicum] [DB:pir2] >gp : [GI :g2622925] [LN: AE000934] 
[AC:AE000934 :AE000666] [PN:unknown] [GN:MTH1793] [OR :Methanobacterium 
thermoautotrophicum] [DB : genpept -bct2 ] [DE :Methanobacterium thermoautotrophicum from 
bases 1640298 to 1655421 (section 140 of 148) of the complete genome.] [NT: Function 
Code: 14. 00 - Unknown, ; similar to,] [LE:89] [RE: 790] [DI : complement] 



ORF Name 



NTID 



NT AA 
AAID — -i — , , score 
Length Length 



probability 



A17b03000993_189437_tl_34 

Description 
NO-HIT 



j [2169 | |594i |47 | 



500 



